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Part A 



Introduction 



Chapter 1 



About the Book 



In this first chapter, we summarize the content of the book to give the reader a broad 
perspective of what Industrial Organization (aka Industrial Economics) is about. Next, 
we discuss our methodology from several points of view and the prerequisites needed to 
follow the exposition. 

1.1 Foreword 

The first thing to do before reading a book about the organization of industries is to learn 
a little about the main sectors of economic activity in the advanced economies we aim to 
study. Figure 1.1 illustrates how services have gain the upper hand over manufacturing 
during last century in the US. 
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1980 



2000 



Figure 1.1: Goods vs. Services in the US (in % of GDP) 



Table 1.1 using data from EuroStat and the US BE A goes into finer detail to compare 
these major economies. 1@ Given the annual GDP of the EU, each %o represents 12bn€ of 
value creation or roughly 24bn€ of sales. Likewise for the US, each %o represents 14bn$ of 
value creation or roughly 28bn$ of sales. 2 ® 
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Table 1.1: Industry Shares of GDP (in %) 

1.2 Purpose 

Our aim in this book is best understood by looking at the historical development of the 
industrial organization field. According to the standard definition, the economic science 
studies the allocation of scarce resources among alternative uses to achieve desired objec- 
tives. Industrial Organization focuses on the firms that are involved in this process. 



On Competition 

The synonyms for competition found in a dictionary are rivalry contest, conflict, strife, 
struggle and quarrel. All convey negative connotations of waste, violence, compulsion, 
insanity and above all irrationality ; as such they can hardly be associated with the 
advancement of humanity. Yet, the competitive market is the Holy Grail of economists 
and serves as the benchmark for normative analysis (here and elsewhere). This paradox 
takes its root in the oblivion of the conditions surrounding the idea of economic competi- 
tion. 

It is only in a society where violence and coercion are checked by a benevolent but 
powerful State that the pursuit of selfish interest is canalized into hard-work, innovation 
and risk-taking instead of theft, fraud, deception or influence. Most of this book is set 
in this ideal world whose practical incarnation is the free market democracy (e.g., US, 
EU). However, we are lead to deal at the margin with non productive behaviors such as 



collusion, predation, exclusion or rent-seeking. It is therefore necessary to have a good 
understanding of how the free market democracy ever appeared and what is the current 
legal framework in these societies. The first part of this book undertakes this task by 
succinctly studying institutions, to wit the state apparatus (and all the actors revolving 
around it) and the legal framework bearing on firms. 

Tradition 

The economic theory tradition focuses on private property, free markets and the price 
system as means to produce and distribute goods and services. This institutional ar- 
rangement is backed formally by the general equilibrium theory and the welfare theo- 
rems; however reality often fails to conform to their preconditions and predictions. In- 
dustrial Organization is the field of economics that tries to address some of these discrep- 
ancies. To take just one obvious issue, we observe oligopoly, that is to say a few active 
firms in the vast majority of markets, instead of the large number of competing firms 
that perfect competition calls for. As a consequence, the rational minded entrepreneurs 
who run these few firms do not act as price-takers but strategically; they recognize their 
interdependence. The neat conclusions of the perfect competition model therefore cease 
to hold. 

Starting with Cournot (1838) and Marshall (1890), economists have developed a body 
of theory to address these inconsistencies; the paradigm that has been in vigor until the 
1970s is called structure conduct performance (SCP). 

Structure studies how far is the actual industrial activity from being a competitive one. 
One looks at the number and size of sellers and buyers in the various markets that 
form the production chain, at the differentiation of products and finally at the entry 
conditions. 

Conduct studies how firms compete when they cease to be price-takers (as a conse- 
quence of the non competitive structure). We are interested by all strategic behav- 
iors like collusion, differential pricing, advertising or R&D. 

Performance deals with the departure from the productive efficiency achieved in a 
competitive market. From a static point of view, we are interested by the effects 
of market power (profitability and cost efficiency), while the dynamic point of view 
raises the difficult question of the optimal pace of innovation. For instance, how 
long shall a patent protect the innovator to encourage innovation. 

A major implication of the SCP paradigm is to justify governmental interference in 
the economic life. Antitrust laws and regulations are passed with the aim of restoring the 
efficiency lost when firms take advantage of their market power. In the US, throughout 



the XX th century, legal activity has been intensive and enfored by the federal commis- 
sions and the supreme court. In Europe similar rules started to be enforced actively by 
the Commission in the 1970s. 

Critique 

The dominant SCP paradigm was challenged by the Chicago school of thought in the 
1960s on two grounds. Firstly, governmental intervention is based on loosely demon- 
strated market failures like the abuse of market power; the remedies proposed need not 
improve on the problem if for instance, pressure groups succeed to capture their regula- 
tor, thereby reinforcing their inefficient dominant position. Secondly, the SCP paradigm 
is entirely driven by price theory and anonymous market transactions, thus it totally ig- 
nores the complex contractual bilateral relations that exist inside firms, between firms or 
among firms and their providers, their clients or their regulator. For instance, a firm can 
voluntarily restrict the number of its suppliers to limit competition among them, it can 
set-up a complex pricing formula rather than a fixed per-unit price to screen customers, 
or an employer can offer its employees a wage schedule contingent on performance in 
order to give them stronger work incentives. The persistence of these practices must be 
explained and their degree of efficiency must be explored. 

The explanatory power of "price theory" for this endeavor is limited because it studies 
how actors interact inside a market, being forced to obey its rule i.e., they are allowed to 
choose only one variable like a price, a wage or a quantity. However, it is well known that 
firms juggle with many decisions at the same time, with issues that belong to different 
dimensions. Contract theory encompasses this complexity by extending the available 
choices of the actors and by explicitly considering other transaction mechanisms beyond 
the market one. At a higher level, institutional economics search for the factors that 
lead firms to adopt a different trading mechanism. For instance, why do firms integrate 
or why do government privatize formerly public industries. Transaction cost or property 
rights are among the proposed answers. These new insights, originally based on cogni- 
tive logic, have been integrated into the mainstream of microeconomics, a fact we wish 
to reflect here. 

The Chicago paradigm also takes a more dynamic view of economic affairs and argues 
that the inefficiency generated by market power is of a temporary nature and is bound 
to be eliminated by the entry of more innovative firms. It therefore calls for a sequential 
analysis that considers the competition for the market on top of the competition in the 
market, the idea of contestable markets. To summarize, the Chicago contention is that 
competition might be the worst economic system except all those other forms that have 
been tried previously in history (misquoting Winston Churchill on democracy). 



Synthesis 

Received Industrial Organization implicitly focused on market competition as the sole 
productive behavior conducive of Pareto improvements for society. Modern research has 
shown that direct contractual relationships (e.g., bilateral monopoly integration, agency 
theory) offer an alternative path for wealth creation. They have made their way into 10 
volumes but mostly as side dishes. We have tried to rethink the structure of the text in 
order to integrate this facet in a more coherent way 3@ 

Another recent avenue of microeconomics research we contemplate is unproductive 
rivalry over existing wealth. Organized contests (e.g., auctions, regulation) or sponta- 
neous conflicts (e.g., rent-seeking or patent races) are interactions mediated by either 
formal or informal rules. The large literature on this topic has yet to be incorporated 
into an 10 textbook in a systematic fashion; ours is a first intent. 

For the reasons exposed above, it is our belief that modern Industrial Organization 
theory ought to acknowledge the role of contracts, explicit as well as implicit. We thus 
take a Contract Based approach to Industrial Organization (hence the acronym IOCB). 
The SCP framework of analysis remains our guiding scheme but the study of conduct is 
extended from pricing behavior to contracting behavior and other forms of rivalry. 

We mostly adopt a positive view as we describe how competing firms operate in order 
to identify the extent of their market power. We then inquire the implications for all 
actors, including consumers or government. When possible, we use the consumer sur- 
plus and market welfare concept to make normative judgments. Thus beyond, trying to 
explain how firms acquire or maintain market power, we also seek to shed light on what 
policy makers can do about it. The notable absence in this book are empirical studies of 
10; a good starting point is Einav and Levin (2010). 

Regarding the style adopted for the crafting of this text, two features are worth ex- 
plaining. Contrariwise to the ahistorical perspective of most economics monographs, we 
have tried to associate each theory or concept with its original author in order to link 
the appearance of novel ideas to their epoch. We believe that political and technological 
developments in the advanced economies where these scientists lived had a definite in- 
fluence over what to study and how to approach issues. In a sense, economic theory is a 
lively science very much related to the real economy, rather than an ordeal escaped from 
an ivory tower. 

The other novel feature of this book is technical as we extensively cross-reference 
sections but also apparently unrelated topics who nevertheless share the same basic 
model or derive from the same fundamental economic problem. Thus, only the original 
model is fully developed while later applications are more succinctly presented. Our 
desire here is to illuminate how a few bare bones support a wealth of theories. This also 



allows the reader to quickly find all the sections where a concept is being used. 

1.3 Summary 

As the adage goes, a picture is worth a thousand words. We thus try with Figure 1.2 to 
convey a graphical overview of the topics we approach in this book. The 8 gray boxes 
indicate Parts while the other boxes more or less refer to the chapters. Locations on the 
chart do not respect truthfully the content, rather they loosely reflect connexions with 
each other. 
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After this introductory chapter, we recall the basics of the perfect competition paradigm 
and present the normative concept of welfare used throughout the book to judge and com- 
pare economic outcomes (ch. 2). We also review quickly some elements of game theory. 

Market power is the ability to raise the market price of its product without losing all 
customers to competitors. We delve into this issue by focusing on the least competitive 
market structure where stands a unique seller, the monopoly. We characterize the op- 
timal conduct and the resulting allocative inefficiency of the monopoly (ch. 3). We then 
study differential pricing, the wide range of discrimination strategies that a firm can 
adopt to take advantage of her market power (ch. 4). 

An oligopoly is a market structure where a limited number of firms with market 
power compete among themselves, realizing their interdependence when taking strategic 
decisions on price, output and quality. We introduce the concept of strategic behavior 



and the game theoretical concept of Nash equilibrium to study the determinants of this 
competition in a variety of situations (ch. 5). On top of being empirically relevant, the 
oligopoly is also the theoretical long term outcome of a perfectly competitive market 
where entry is only limited by the existence of fixed costs. We thus examine this entry 
dynamic as well as other strategic decisions that firms can take to improve their lot (ch. 
6). Lastly we complement market driven rivalry with pure economic rivalry (ch. 7). 4@ 
We offer a variety of models dealing with conflicts, contests, patent races, rent-seeking, 
political pressure. 

Armed with the previous theories and their predictions, we take on anti-competitive 
behavior. At the root is the fact that competition harms profits so that a firm may ratio- 
nally decide to adopt practices harmful for society in order to obtain or defend a profitable 
incumbent position. Yet, fair and undistorted competition is a cornerstone of any market 
economy. We thus present the playing field upon which our theories are constructed, that 
is to say the legal framework of the European Union and of the United States of America 
(ch. 8). Their antitrust policies pursue the goal of defending and developing effective 
competition. The next two chapters are devoted to modeling potentially anti-competitive 
behaviors. We first deal with the elimination of concerted practices such as cartel, collu- 
sion or restrictive agreements (ch. 9). We explain why they reduce welfare and how they 
are combated. Then we look at abuse of their dominant position when firms try to block 
entry or force exit of challengers with limit pricing, preemption, predation, foreclosure 
or attrition (ch. 10). 

The next part focuses on non-price strategies whereby firms seek alternative ways to 
relax competition. Differentiation and innovation aim to make a product nearer to con- 
sumers wants so as to make it relatively more attractive than competing proposals and 
thereby obtain a lawfully exploitable market power. We first study how competition is 
affected when products are differentiated (ch. 11). The vertical or quality differentiation 
amounts to design a product to make it more attractive to all potential buyers while hor- 
izontal or characteristics differentiation amounts to expand a basic product into a whole 
range to address the heterogeneity of customers. We also treat advertising which is to 
make one's product known to all, and branding which is to associate the product with a 
particular group of consumers. Lastly, chapter 12 deals with innovation understood as 
a broad concept, of either a technical or subjective nature. Firms invest in research and 
development (R&D) to develop new products, new features or new ways of doing things 
(faster or cheaper). We look at the framework where innovation takes place, then study 
the incentives for firms to invest in that activity; we end with a very important byproduct 
of the innovation process is its legal protection which is treated in §12.3 on intellectual 
property rights (IPRs). 



The part dedicated to integration gathers all the theories dealing with the expansion 
(and contraction) of firms, how they grow from handicraft to conglomerates via merger 
and acquisition or why they suddenly decide to concentrate on core activities and home 
markets by divesting unwanted units. Stated in other words, we try to identify the 
boundaries of firms. It is customary in economics to distinguish vertical and horizontal 
integration i.e., whether the two firms about to merge make complementary or substi- 
tute products. In the first case, the decision to acquire a provider or a distributor is re- 
lated to the age-old "make or buy" quandary: what are the transactions more efficiently 
conducted inside a firm, on a market or face-to-face with another firm? Horizontal in- 
tegration regards mergers involving similar firms who look to reach a "critical mass" 
and take advantage of scale economies and complementarity w.r.t. geography, brands, or 
segments. 

Both dimensions of integration are therefore treated together in this part. We define 
a firm and how it works, before presenting the various economic theories of the firm as 
well as a sketch of contract theory (ch. 13). Then, we tackle issue of vertical integration 
such as the bilateral monopoly, specific investment and the hold-up problem (ch. 14). We 
also compare formally the so-called transaction cost and property rights theories of the 
firm. Lastly, we treat horizontal integration by presenting the numerical methods used 
by antitrust authorities to decide whether competition is endangered or not by a merger 
(ch. 15). 

One of the novelty in this book is to dedicate a part to public oversight. The state 
(ch. 16) is the superstructure gathering all the organizations in charge of mediating the 
economic activity; it includes the legislative, judicial and executive bodies and within 
the latter, the government, its bureaucracy and regulatory agencies. We seek to explain 
its dual role as judge and party and how its origin impinges on today's legal framework. 
We present succinctly the State and look at the various rationalizations for its encroach- 
ment upon economic life. We then look at the related activities of rent-seeking and the 
liberalization trend of the last decades. The next chapter is devoted to regulation, the 
direct oversight of the State upon markets thought to suffer from failures such as the ex- 
ercise of market power (ch. 17). We end this part with a synthetic treatment of natural 
resources as these markets display market power and externalities, and thus stand as 
perfect candidates for public oversight (ch. 18). 

This next part introduces the reader to information economics; it builds on incentives 
and asymmetric information to fruitfully complement the traditional price theory ap- 
proach of Industrial Organization. This part is the largest in the book due to the wealth 
of material to be exposed and the necessity to introduce some background first. As in the 
previous part, we relinquish the market playing field to focus on the firm itself. 



Our first task is to update microeconomics concepts regarding risk and uncertainty 
where imperfect information is added to the standard microeconomic theory (ch. 19). 
We then focus on incentives and hidden action known as moral hazard (ch. 20) before 
moving more specifically to asymmetric information with the concept of hidden infor- 
mation known as adverse selection (ch. 21). We then give an account of auctions (ch. 
22), a competitive trading mechanism used to extract or reveal information; it extends 
adverse selection. Lastly, we encroach upon the field of corporate finance to examine 
how these asymmetric information phenomena impinge upon the financing of an en- 
trepreneur-owned firm (ch. 23). 

We wind up the book with network industries, characterized by network externalities, 
whether on the demand side or on the supply side. The study of externalities does not be- 
long per-se to the field of Industrial Organization but their presence in many oligopolistic 
or regulated industries shape the strategies of firms and regulators. The technological 
features giving rise to externalities are quite different according to whether we study 
goods or services so that different chapters are called for. Chapter 24 on standards and 
components deals with oligopolistic markets for goods where a positive externality is gen- 
erated by the demand side. The main issue for the competing firms is to decide on their 
degree of differentiation; they can either make products compatible with their's chal- 
lengers or not. Chapter 25 deals with service industries relying on a physical networks 
e.g., transportation, energy or communications. Both positive and negative externalities 
are present here. The positive one comes from the supply side as the technology displays 
increasing returns to scale (natural monopoly). The negative network externality is the 
congestion created by excessive use. This singularity requires a specific market design 
since a competitive market would never emerge. 

1.4 Methodology 

We start with the methodology of economic science before narrowing our scope to the 
topic of this book and conclude with the prerequisites for a fruitful reading. Readers 
accustomed with economic methodology may want to skip this section; we nevertheless 
invite them to (re)read the following timeless quotations. 

Knight (1933) There is no more important prerequisite to clear thinking in regard to 
economics itself than is recognition of its limited place among human interests at 
large. 

Pareto (1897) Economic science does not attempt to establish any particular method 
of economic organization, and it is not the business of science to do so. Science 



does, however, attempts to solve problems of the following kind: (1) What are the 
effects of a regime of free competition? (2) What are those of a regime of monopoly? 
(3) Those of a collectivist regime? All these questions must, of course, be treated, 
not from a polemical point of view, but solely for the purpose of ascertaining what 
results would follow upon their installation. It is especially necessary for us to 
discover what relation these results bear to the aggregate well being of humanity. 

Poincare (1905) Science is facts; just as houses are made of stones, so is science made 
of facts; but a pile of stones is not a house and a collection of facts is not necessarily 
science. 

Solow (1956) All theory depends on assumptions which are not quite true. That is 
what makes it theory. The art of successful theorizing is to make the inevitable 
simplifying assumptions in such a way that the final results are not very sensitive. 
A "crucial" assumption is one on which the conclusions do depend sensitively, and 
it is important that crucial assumptions be reasonably realistic. When the results 
of a theory seem to flow specifically from a special crucial assumption, then if the 
assumption is dubious, the results are suspect. 

Dasgupta (2002) Critics of modern Economics take refuge in such aphorisms as that 
"it is better to be vaguely right than precisely wrong". What this misses, however, is 
that you won't even know if you are vaguely right if you operate within a framework 
in which you cannot be precisely wrong; there is no way to controvert a vague 
statement. 



1.4.1 Scientific Methodology of Economics 

Social sciences are often called the "soft" sciences in opposition to the "hard" sciences 
formed by physic, chemistry, biology and all their derivatives. Economic theorists like 
to see their field as the hardest of the soft sciences because it has adopted the modus 
operandi of pure science, namely the construction of theories and models that are worked 
out using the hypothetico-deductive method (cf. Poincare (1905)). It is worthwhile notic- 
ing that most of the authors who contributed the material presented in this book were 
trained as engineers or mathematicians before turning to economics; their achievements 
based on the use of advanced mathematics have cleared questions that intuition alone 
could not dominate. 

An economic model is a reflection upon past observations and experiences aiming at 
understanding the existing relationships among economic agents or between different 
situations. It tries to single out principles or laws that could apply to a class of prob- 
lems; 5 *^ this way we can anticipate the outcome of new circumstances as they appear or 



propose policy changes that are likely to generate a movement towards a more (socially) 
desirable situation. Practically an economic model should be based on assumptions that 
we derive from the observed reality taking care to select the most relevant ones leaving 
aside details. There should be enough components (bare bones) to enable the derivation 
of non trivial results but not too much to avoid inextricable complications. 6 ® 

The analysis of the model is the step where economics enters the domain of logic. The 
implications we derive may confirm or invalidate our original views; in the latter case 
the analysis helps us to comprehend what and where something got wrong, whether a 
false assumption or an invalid idea. The interaction between ideas and models is thus 
bidirectional; it is a process of trial and error that permits to devise models that come 
always closer to explaining the real situation. Lastly whether a model turns out to be 
useful depend on how and why we use it. Computing the trajectory of a tennis ball is 
quite different from that of a satellite; it should therefore not be used as a proxy of the 
satellite's trajectory because one would almost surely lose the satellite. 

Our approach in this book is to bring forth in the simplest way theories and results 
that, after being challenged and tested by academics and practitioners, have passed the 
test of time. We mostly draw on the recent literature who authors aim at uncovering the 
simplest assumptions needed to generate interesting conclusions. We expose them with- 
out excessive mathematical rigor in order to devote more space to motivation, historical 
cases, relevance and above all economic intuition. 

1.4.2 Individual Rationality 

Thesis 

As we already explained in the book's summary, our fundamental tool is game theory; 7 ® 
its use is warranted by our adherence to the principle of individual rationality which 
is almost universal in modern economics and derives itself from the wider philosophical 
doctrine of Methodological Individualism. The basic tenet of the latter is that all collec- 
tive phenomena are the outcome of the actions and interactions of individual decision 
makers as well as of the traditions created and preserved by them. 

Individual rationality states that people make rational choices not foolish ones. More 
precisely, a decision-maker chooses the best action according to her preferences, among 
all the actions available to her. No qualitative restriction is placed on the decision- 
maker's preferences; her "rationality" lies in the consistency of her decisions when faced 
with different sets of available actions, not in the nature of her likes and dislikes. An 
economic decision maker is thus seen as a different breed of human called Homo Eco- 
nomicus whose main, if not unique, motivation is greed. 



Antithesis 

Everyone will agree with Leijonhufvud (1993) that economic theory describes the behav- 
ior of incredibly smart people in unbelievably simple situations whereas reality is more 
one of believably simple people coping with incredibly complex situations. Over the last 
decades, the study of the human brain has shown that people's mental limitations rule 
out the perfect rationality embodied in "Homo Economicus". This is because many of our 
decisions and behavioral responses are in fact reflexive and emotional (we have no time 
to think), thus bound to be ridden with mistakes. Yet, the learning process we experi- 
ence through life is precisely aimed at integrating useful and valuable reflexes to avoid 
future losses. Thus, our mistake-ridden decision processes are, on balance, efficient and 
welfare enhancing. This aspect is further enhanced when individual behavior is aggre- 
gated among numerous people; it does not matter if a few people make mistake when 
the vast majority follows a rational path, the overall pattern of behavior will be almost 
rational. In a nutshell, the interaction of "ordinary" people in markets tends to produce 
"incredibly smart" results. 

An important factor, as far as this book is concerned, is that the decision maker is 
not a lonely human being but a collectivity known as the firm. Thus, although perfect 
rationality poorly matches human behavior, it does a better job when it comes to firms 
and organizations. In the same vein, many empirical irrationalities are macro phenom- 
ena with a rational micro foundation i.e., people act rationally given the informative and 
processing constraints they face, but market or government failures aggregate their indi- 
vidual behavior in an inappropriate manner. Most of these instances share an analytical 
similarity with the prisoner dilemma whose solution occupies a prominent place in this 
work. 

The obsession of "Homo Economicus" with money is another source of criticism. A 
wide range of experiments based on behavioral game theory have shown that honesty, 
integrity, intrinsic job satisfaction, and peer recognition are powerful motivators, and 
lead to better results for contracting parties than reliance on financial incentives alone. 
Hence, we rationally use a wide variety of medium beyond money to interact in society. 
It must be understood though that other motivations are not denied or deemed incon- 
sequential; rather, they are ignored for the time being to allow analytical progress. In 
a sense, this happens because economists have themselves limited intellectual faculties 
and are thus forced to simplify their theoretical models. In this respect, economics is no 
different than other sciences and over the last decades, it has started integrating the full 
range of human motivations within the basic staple of rational choice. 8@ We echo this 
effort at some points in the book. 

Next "Homo Economicus" is too polite because for him "a handshake is a handshake". 



More generally, the standard framework of neoclassical economics is a dream world dis- 
connected from reality. In Walrasisan models, agents are anonymous because they trans- 
act at no cost on markets, not with actual people. There is no hierarchy in the firm i.e., 
capital may hire labor or the reverse. Firms have no power as they only produce to 
serve the wants of consumers using the price signals transmitted by markets. Similarly, 
people have only purchasing power. Life as we know it is far bleaker and full of power 
play. As recalled by Williamson (1985) (p. 51), modern economics must go beyond market 
exchange where one only seeks to buy cheap and sell dear to include "the full set of ex 
ante and ex post efforts to lie, cheat, steal, mislead, disguise, obfuscate, feign, distort 
and confuse". This novel aspect is fully dealt with in Chapter 7 and more generally, the 
previously mentioned weaknesses are tackled along the book. 

Synthesis 

In view of the stream of criticism, one may wonder why has the "Homo Economicus" 
paradigm been so fruitful for economic analysis and so lonely. One possible answer is 
to observe with Myerson (1999) that no one has yet developed an accurate and tractable 
theory of the inconsistency and foolishness in human behavior; so our best analytical 
models are based on the rationality assumption for lack of any better foundation. But 
a more compelling answer starts by recognizing that social science is not about predict- 
ing human behavior, but about analyzing social institutions and evaluate proposals to 
reform them. 

When looking for potential flaws in a social institution like the senate of a democratic 
country, it is helpful to assume that the agents interacting in the institution, the sen- 
ators, are not themselves flawed (or insane). Otherwise, when citizens complain about 
the institution, we cannot say whether the institution should be reformed or whether 
senators should get better education! To do any kind of analytical social theory, we must 
describe institutions and predict the individual's likely behavior inside these institu- 
tions. To handle normative questions (what should be done), there must also be some 
concept of human welfare in our model. If we assume that some individuals are not 
motivated to maximize their own welfare (as measured in our model) or that some in- 
dividuals do not understand their environment (as predicted in our analysis), then any 
loss of welfare that we find in our analysis can be blamed on such dysfunctional or mis- 
informed individual behavior, rather than on the structure of social institutions. This 
means that a defense to reform social institutions (rather than reeducating individuals) 
is most persuasive when based on a model which assumes that individuals intelligently 
understand their environment and rationally act to maximize their own welfare. Only in 
that case can they complain about the poor performance of their institutions in a manner 



that is meaningful. 

1.4.3 Prerequisites 

This textbook intends to provide a complete panorama of Industrial Organization build- 
ing on many separate modules forming, we hope, a coherent whole (cf Figure 1.2). The 
prerequisites for a successful reading are low but not nil; some knowledge of microe- 
conomics^ will help the reader to grasp 10 ideas faster. For completeness, chap. 2 
provides a quick review of all the useful concepts. Next, we assume that actors (firms, 
managers, consumers, governments) are rational so that their interactions can be ana- 
lyzed by the models of game theory. To avoid duplication with excellent material already 
published, we only offer a minimal introduction in §2.4. 10@ 

The conciseness achieved in this book has been made possible by recurring to a cer- 
tain amount of (mathematical) formalism, in an amount sensibly greater than for simi- 
lar economics manuals. We therefore assume some acquaintance with mathematics. The 
reader should know a minimum of functional analysis and linear algebra e.g., the con- 
cepts of function, derivative, maximization and linear systems of equations. 11 ® It is the 
author's belief that mathematical formulae ought to be kept to a minimum in economic 
textbooks; as a consequence most space is devoted to intuitions, informal theories and 
examples. The other side of the coin is a certain dryness in the formalization but it ought 
not be a difficulty, neither for scientifically trained readers nor for economic students who 
have already seen the basic tools of optimization in economics. 

Mathematical Optimization 

The one fundamental tool the reader should keep in mind is exposed on Figure 1.3. In a 
typical economic problem, an agent cares for some dimension y such as profit or utility 
that is determined by some variable x such as quantity, price or quality through the 
relationship y = f{x); it is shown identically over the four panels. The unconstrained 
maximum of / is achieved at xq, the solution to the first order condition f'{x) = (FOC) 
i.e., when the curve becomes flat. This mathematical optimum is NOT the economic 
optimum, it is only a candidate because in most situations, the decision maker faces 
restrictions such as x > X\ and x < x 2 that arise from his interaction with other economic 
agents or with the market. 

On panel O, these conditions define two stripped areas of allowed values; their inter- 
section, the shaded area, is the economic domain from which the decision maker can pick. 
Since x belongs to this domain, the economic optimum is x* = x . On panels © and ©, 
the economic domain does not contain the mathematical optimum so that the economic 
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Figure 1.3: Mathematical Optimization 



optimum is a corner solution, respectively the maximum and minimum allowed levels 
for the endogenous variable (x* = %2 resp. x\). Lastly, panel © displays the case where 
the restrictions are incompatible among themselves and leave no way for the agent to 
make a pick. His optimization problem, in that case, has no solution. In retrospect, we 
may say that the economic problem was badly formulated. 



Chapter 2 



Microeconomic Foundations 



This chapter recalls some key results of microeconomic theory and game theory. The 
concepts of marginal cost and willingness to pay are basic tools for the analysis of firms 
behavior while the efficiency of the competitive equilibrium will stand as a benchmark 
for our (limited) normative analysis (cf §2.3.3 for a discussion). 

2.1 Supply 

The neo-classical theory of supply encompasses the notions of production, cost, competi- 
tive supply and some miscellaneous properties relating to the multiplicity of plants and 
products. 

2.1.1 Equi-marginal Principle 

The equi-marginal principle (aka Gossen (1854)'s second law) is a staple of neoclassical 
economics worth recalling here. In many static optimization problems, a rational deci- 
sion maker must allocate scarce resources over a variety of employments. The "manna" 
trick is a procedure or algorithm that repeatedly applies arbitrage to derive an equi- 
marginal principle as a property of an optimal allocation. It works as follows: 

• Compute the value of a small gift when allocated to a specific employment 

• Compare these values for all possible employments and allocate the gift to that with the 
highest value. 

• Observe that the marginal value of the winner employment has decreased 

• Prove that the wedge between winner and forerunner is reduced 

• Divide a large gift into a series of smaller ones 

• Reason that incremental allocation will bring all marginal values in line 

• Conclusion: at the optimum, marginal values in all employments are equal 



2.1.2 Production 

In this section, we look at technology per-se and productive efficiency. 

Technology 

The neoclassical theory reduces a firm to a single, rational minded, economic agent, the 
entrepreneur. He/she owns or knows a production technology completely and uniquely 
characterized by the maximal quantity 0(X,L) that can be produced by using K units of 
capital measured in €'s and L units of labor measured in hours of unqualified work. 1@ 

The productivity of the input factor K (resp. L) is defined as O^ = |^ (resp. O^), it is 
positive and often assumed decreasing with the input (the quantity of the other input 
being fixed). An isoquant is the locus of capital/labor pairs {K,L) such that production 
0(X,L) remains constant. On the isoquant, total differentiation 2 @ yields the marginal 
rate of technical substitution (MRTS): 

-*U55 (2.1) 

dK ® L 

which values one unit of K in units of L; it is the subjective 3 ® price of K (in units of L). 

The technology exhibits constant returns to scale (CRS), decreasing returns to scale 
(DRS) or increasing returns to scale (IRS) if an increase of A% of the factors generate 
an increase of production equal, lesser or superior to A%. In the presence of DRS, sev- 
eral small production units are call for in order to achieve productive efficiency. On the 
contrary, all inputs should be grouped into a single production unit under IRS (cf check 
when each applies). 

A frequently used technology is Cobb and Douglas (1928)'s ®{K,L) = AK a L^ with a,$ > 
0. Notice that the A parameter can always be normalized to unity by a convenient choice 
of physical unit of production (but then the price cannot be treated likewise). There 
are DRS if the sum a + /3 is lesser than unity, IRS if greater and CRS if equal. Notice 
that when the technology parameters satisfy < a,f5 < 1, we have % and <$kk < which 
corresponds to the usual assumption of "decreasing marginal productivity". However, 
the sum a + /3 can be anywhere between zero and 2, thus any kind of returns to scale are 
possible; this is so because the complementarity between the inputs mutually reinforce 
their productivity although each by itself displays DRS. 

Productive Efficiency 

Productivity Analysis is one of the most important econometric instrument at the dispo- 
sition of public authorities to improve the effectiveness of an industry both in terms of 



costs and quality of service. As we shall later comment in §17, regulators use produc- 
tivity analysis to implement yardstick competition among regulated firms belonging to 
the same sector or industry. We present below the intuition underlying the theory origi- 
nally developed by Farrell (1957) to judge in a practical way of the productive efficiency 
of (comparable) firms. 4@ 

Consider a good or service and the "state of the art" technology to produce it. The 
minimal combinations of factors (inputs) K and L necessary to produce an output Q form 
an efficient isoquant displayed on the left panel of Figure 2.1. Any firm in the industry 
will have to use a larger combination like A to reach the same goal; thus we can speak of 
the ratio ^ £ [0; 1] as the technical efficiency of A. If factor prices are such that the cost 
minimizing combination is C (tangency between cost line and isoquant) then the ratio 
jjg £ [0;1] can be interpreted as the price efficiency of the factors proportion used in A 
(and B). Now, the product ratio ^ £ [0;1] can be meaningfully regarded as an index of 
productive efficiency obtained by multiplication of the previously defined indexes. If the 
technology under scrutiny has constant returns to scale then the ratios of two different 
firms can be compared although the size of firms (their output) are different. 



AK 





One firm Comparing firms 

Figure 2.1: Productive Efficiency 



The previous construction is very elegant but hinges on some "state of the art" tech- 
nology that even a careful engineering study would fail to adequately identify; an easy 
way out of this awkwardness is to build an empirical production function based on the 
best results observed in practice. Since firms sales are different, the quantities of factors 
they use have to be scaled by total output value (measured by sales) so that the axes on 
the right panel of Figure 2.1 are "inputs per unit of output". Plotting together the com- 
binations of factors used by different firms is meaningful if we keep assuming constant 
returns to scale. The lower contour of the scatter of points defines the empirical efficient 
isoquant (for one unit of output); firms on that curve (dots) are deemed efficient while 



the others (diamonds) are deemed inefficient. For one of them such as A, an algorithm 
identifies the nearest efficient points B and C and use the ray to the origin to construct 
D, a blend of B and C, which uses the factors in the same proportion as A; the index of 
efficiency is as before ^ e [0; 1]. 

2.1.3 Cost 

In this section, we review the notions of fixed cost, marginal cost, impact of long-run and 
economies of scale. 

Fixed Cost 

To start its productive activity, a firm has to incur a start-up costs F including expen- 
ditures on Research and Development (R&D), a government license, organizing a mar- 
keting department and most importantly building or acquiring facilities for production, 
transportation, communications and distribution. 

Accounting and economics differ in their treatment of the cost of using an asset for 
production in two fundamental aspect. Firstly, the opportunity cost of the asset is the 
net revenue from the best alternative use (i.e., pursuing another activity with this asset). 
Secondly, part of the fixed costs are sunk cost in the sense that they have been committed 
and cannot be recovered. The reason is that some activities require specialized assets 
that cannot readily be diverted to other uses. This applies to the capital assets listed 
above. Examples are investments in machines which can only produce a specific item, 
the development of products for specific customers, advertising expenditures and R&D 
expenditures. For economic reasoning, those sunk cost are not taken into account to take 
future decisions i.e., we adhere to the proverb "let bygones be bygones". Accounting, on 
the other hand, keeps them in the books. 

Another important characteristic of a fixed cost is its indivisibility. Most industrial 
projects require an investment of a minimum size in which case the fixed cost is said to 
be lumpy. Outsourcing (using subcontractors) is a way to develop an activity without 
bearing its associated fixed costs but at an obviously higher marginal cost. 5@ 

Marginal Cost 

The marginal cost is the cost of producing an additional unit while the average cost is 
the "typical" cost of any of the units that have been produced so far. To define precisely 
these concepts we first go trough the derivation of the cost function. 



The neoclassical approach to costs uses the production function and the existence of 
perfectly competitive markets for inputs, mainly capital and labour. Given the prices r 
and w of the inputs K and L, the cost of producing a quantity q is 

C{q) = mmF + rK+wL s.t. <&{K,L)>q 

K,L 

Notice first that because O is increasing with both inputs, the constraint must be 
an equality at the solution (otherwise it would be possible to reduce input expenditure 
a little and still meet the production requirement). Our second observation is that the 
optimal capital/labor mix equalizes the subjective relative price -jp of Kin units of L to the 
objective relative price -^ observed on the market. If it was not so, one would be able to 
substitute a little of the relatively expensive input by the cheaper one while maintaining 
the production constant; total cost would then be reduced. 

If we apply these considerations to the Cobb and Douglas (1928) technology ®{K,L) = 

K a L.P then |r = f|f an d we deduce L = K-^ which can be substituted into the production 

i p 

constraint <&{K,L) = q to yield the conditional capital demand K* = q a+ f i [jf) a +v . Reporting 

this value into the FOC, we obtain the conditional labour demand L* and finally 

C{q) = rK* + wL* = (l + |) rK* = (cir a w p q) ^ (2.2) 

where Q. = a^/5~^ + a~ a /3 a (the manual derivation of these formula is a lengthy but re- 
warding exercise). 

The key concept of marginal cost is defined as the derivative of the cost function: 

C m {q) = — — 
dq 

and is related to the optimal factor mix condition by 

-Z— — "T - — C m \2,.o) 

We have thus obtained an equi-marginal principle (cf §2.1.1) stating that whenever 
two factors like machines, building, land, skilled or unskilled workers have a degree of 
substitutability, their market compensated marginal productivities have to be equalized 
(to \IC m ). Reworking (2.3) as y = ^, we conclude that every firm, whatever the good she 
produces, will equate her MRTS of any two factors to the ratio of their relative prices. 
Observe lastly that total cost of producing some quantity q is the sum of the fixed cost F 



and the (marginal) cost of all units, from first to last; in formulas: 

C(q) = F+ £ C m (x)=F+ [ C m (x)dx 

x=l JO 



(2.4) 



Short vs. Long Term Cost 

In the short-term, the current level of capital K cannot be adjusted so that only labour L 
remains flexible to meet the production objective q = F{K,L). The short-term cost function 
is therefore greater than the long-term one since a degree of freedom in the search for 
cost minimizing inputs is lost. However, when the long-term demand for capital K* is 
exactly equal to the short-term availability K , the short-term cost is equal to the long- 
run one; this occurs for only one production level q because the demand for capital K* is 
increasing in q. At this point, short-term and long-term marginal cost are also equal. 6g 

In our Cobb-Douglas example, the long-term cost function is given by (2.2); the short- 

i — 
term demand for labour is L = q P K P (we isolate L from the production equation q = K"L,P) 

it 1 
so that the short-term cost function is C st {q) = F + rK + wK p qP . Figure 2.2 plots both 

cost and marginal cost functions for the case where a = /3 = \ and F = 0; we have C st {q) = 

rK + ^-, C^ = ^, C lt (q) = Isfrwq, C% = Isfrw and q = 2\frTwK . 

k marginal cost 





Figure 2.2: Short vs. Long Term Costs 



Scale Economies 

Scale economies capture the idea that duplicating production does not duplicate cost, 
hence production ought to take place in a single plant instead of several in order to 
reduce total cost. The formal condition is C{Xq) < AC{q) for any A > 1 (the verbal definition 
uses X = 2). It is simple algebraic exercise to check that economies of scale are exactly 
equivalent to the property C m < AC which is itself equivalent to a decreasing average 
cost (cf Fig. 2.3). 7@ Diseconomies of scale correspond to the reverse situation (C m > 



AC o C{Xq) > XC[q) for any A > 1) and tell us that production ought to be divided among 
many small identical plants. 

Examples of scale economies include most activities involving a physical transporta- 
tion network. Meanwhile the network capacity is not exhausted, the marginal cost is 
more or less constant c; the average cost is then decreasing as the fixed cost of the net- 
work is passed upon more and more units. 

For most activities though, the initial range of output displays scale economies (Cm \), 
then constant returns to scale (C m cte) over a range of normal business and finally dis- 
economies of scale (Cm /) when output overshoots the plant's capacity. This gives rise to 
Edgeworth (1913)'s celebrated U-shaped average cost curve which crosses the marginal 
cost curve at its minimum, the minimum efficient scale (cf §2.1.4). 

2.1.4 Competitive Supply 

In this section, we first recall the definition of a competitive market before looking at the 
optimal production of a competitive firm and its competitive supply. 

Competitive Market 

Edgeworth (1881) defines & perfectly competitive market by four conditions: 

• HI: Homogeneous goods are for sale 

• H2: Large number of buyers and sellers 

• H3: Perfect price information for buyer and sellers 

• H4:No barrier to entry for potential buyers or sellers (cf. §10.2) 

An equilibrium of this market is a situation where everyone performs the trade she 
wishes given the prices she observes and the decisions taken by other agents. Conditions 
HI and H3 imply a unique price in equilibrium because every buyer can see the lowest 
price between several identical goods and therefore buys the cheapest. Condition H2 
implies that no one has market power i.e., can credibly threaten anyone to buy at a lower 
price or to sell at a larger price, hence the price is taken by everybody as given. With 
this price-taking behavior the equilibrium price equates demand and supply. As long as 
profits are positive on this market, H4 enables entry of economic agents seeking a better 
remuneration than elsewhere, thus in the long run sellers profits are nil. 8@ 

Optimal Production 

A firm participating in a competitive market completely lacks market power and ratio- 
nally adopts the price-taking behavior. The price p becomes a parameter imposed by the 



market while the supplied quantity q is the only variable to be chosen by the firm. Using 
the cost decomposition (2.4), the profit reads 



n{p,q) = qp-C(q) = -F + £ p-C m {x) 



(2.5) 



x=l 



and we define the summation term as the producer surplus (equivalently, it is profits 
nets of fixed cost i.e., n{p,q) +F). The difference p-C m {x) is the profit derived from unit 
#x, it is called the marginal profit. The optimal production q, maximizing profit n (or 
producer surplus), solves 

p = C m (q) (2.6) 

and is intuitively derived from (2.5) by noticing that profit increases with additional 
production meanwhile the marginal profit p - C m {x) remains positive. As can be grasped 
on Figure 2.3, the competitive supply curve is just the marginal cost curve but since we 
are looking for a quantity as a function of price, the exact formula is the inverse function 
i.e., s{p) = C^(p). 




q = s(p) 



Figure 2.3: Individual Supply 

The Cobb-Douglas technology q = K a L,P provides an easy application in the present 
case of decreasing returns to scale (a + /3 < 1). We previously derived the marginal cost 

1 (" 1 —11 

function C m {q) = \q a+ P for some A > (cf eq. 2.2); the competitive supply is thus its 

g+/3 

inverse s{p) = [Xp] la ^ . 

At the optimum q = s{p), firm profit becomes a sole function of the market price U{p) = 
n(p,s(p)). 9@ The producer surplus is then equal to the triangle area between the price 
and the marginal cost curve but beware that this is true only when the quantity is the 
competitive supply. 



Optimal Behavior 

The optimal quantity s{p) has been derived as the solution of the "p = C m " equation 
which implicitly assumed that the firm was going to be active i.e., produce at least one 
unit. More formally we have maximized the producer surplus which differs from profit 
by the fixed cost. Whenever the latter is very large, the truly optimal strategy is to shut 
down temporarily because the maximum producer surplus is lesser than the fixed cost 
which can be avoided by not starting production (recall the convention we adopted to 
remove any sunk cost from the fixed cost). This situation is characterized by U{p) < 0. 

The average cost AC{q) = —£- is useful to account for fixed cost and check whether 
producing something is the optimal policy. Indeed, profit can be factorized by production 
into n{p, q) = q[p- AC{q)) so that positive profits are obviously equivalent to average cost 
below the market price. It is a simple exercise to check that the minimizer q of AC 
satisfies C m {q) = AC{q) = p; w@ this quantity is called the minimum efficient scale while 
the corresponding marginal or average cost p is the long term limit of the equilibrium 
price in a perfectly competitive market with free entry of firms using the technology 
0. 11@ These concepts are illustrated on Figure 2.3 for some positive fixed cost. 12@ 

Hotelling's lemma 

Differentiating n and using the envelope theorem, we obtain Hotelling's lemma: s{p) = 
IT {p) stating that the firm's behavior reveals her preferences; indeed, if we are able to 
follow the evolution of profits with the evolution of the market price, then we can recover 
the competitive supply function which is to say, the marginal cost function that tells us 
almost everything about the firm's technology. 

It is sometimes convenient to skip the construction of the cost function and derive 
the demand for inputs and the supply of output directly from the profit function n = 
pQ>{K,L) -rK- wL-F. The prices p, r and w are exogenous parameters while the inputs 
K and L are the endogenous choices to be made; the FOCs <&L=wlp and O^ = rip yield a 
unique pair [K*,L*) function of the relative prices w/p and rip. The competitive supply 
is then simply <&{K*,L*). 

Applied to the Cobb-Douglas technology q = K a lP , we first use the marginal prin- 
ciple f = RMST K i L = % to obtain f = ® L {K,L) = ^I^ 1 and jgf — ) L a lP~ x = — o 
r ul v \ f>r } p 

a ac^-a w a-l r -a p = L l-a-f} 

L* = {a a p l - a w a - 1 r- a p) T ^ 



aw 



K ^^-L* = {a 1 ^^w-^- 1 P y- a - 



Sip, w, r) = ®{K*,L*) = (a a p fi w- fi r- a p a+fi ) 1_ "- 



2.1.5 Miscellanies 

Factor Demand 

Firms sell output in competitive markets but also buy inputs, the factors of production. 
At the optimal level of production where price equates marginal cost, the marginal pro- 
ductivity of each factor is equal to the ratio of this factor's price to the output price; for 
instance Fi = ^ and Fk = ~. Now, since we generally assume marginal productivity to be 
decreasing, it must be the case that factor demand decreases with the factor price; this 
is indeed the case for the Cobb-Douglas technology seen above. To conclude, the firm, as 
a buyer of factors, behaves like a regular consumer i.e., bid a decreasing quantity-price 
schedule in competitive factor markets like those for labour, capital or raw materials. 

Multi-plant Firm 

The previous issue of grouping or not production into a single plant leads us to consider 
the related matter of the multi-plant firm which is faced with the problem of allocating 
total production (of the same good) among its various production centers to minimize 
overall cost i.e., we are looking for 

Ciq) = min I c\q t ) s.t. q=Lqt (2.7) 

If at least one plant has economies of scale then all production should take place in the 
cheapest one. Otherwise all plants present diseconomies of scale and the solution is 
quite simple: given the actual level qi in each plant i = 1 to n, rename plants to obtain 
C mt iiqi) < C mi 2iq2) ^ ... ^ C m>n iq n ). The optimal way to produce more is to use the first 
plant, hence bit by bit its marginal cost will come to equate that of the second plant. If 
we want to further increase production we will allocate units among plants 1 and 2 to 
maintain C m< \ = C m>2 but both marginal cost will increase to the point where they equate 
C m $. Generalizing this equi-marginal principle (cf §2.1.1), we find that the optimal 
allocation is characterized by equality of al marginal costs. 

C l m iqi) = C 2 m i q2 ) = ... = 0?„) (2.8) 

If all plants are identical with C l (.) = (pi.) then, by symmetry, production is identical 
in all plants so that Ciq) = mpiqlri) < (piq) (since the common technology presents disec- 
onomies of scale). 



Multi-products Firm 

Consider now a firm using factors such as capital and labour in several independent 
plants to produce a variety of commodities labeled j = 1, .., k. The total profit is n = E j<k n j 
where the branch ;' profit is ttj = <&HKj,Lj) - wKj - rLj. 

As we already saw in equation (2.3), maximizing the branch profit tij leads to pj = 
■tj = -y => Pj& L = w- Since the cost of labour, the wage w, is the same in all branches, we 

obtain Pj® J L = Pi® l L for any two branches i and ;'. 

Let us now introduce the marginal rate of transformation from good j to i (MRT) as 

-j-f ; this ratio expresses how much production of good j must be given up to produce one 
additional unit of good i without changing the overall quantity of labour. 

We may now say that the overall profit is maximal only if the chosen quantities of 
factors are such that 

Pj_ __ J_L 

Pj " $i 

for if the MRT is not equal to the price ratio, then there exists an arbitrage opportunity 
that amounts to transfer an employee from one branch to the other. The same marginal 
principle holds for other factors such as capital (cf §2.1.1). Summarizing, we have 

°r Pi °i 

-4 = ^ = ^ (2.9) 

<D[ Pj <D^ 

Notice that if the branches were independent firms, (2.9) would nevertheless hold 
as a consequence of (2.3) because firms adjust their factor purchases through the factor 
prices: Pi® l L = w = Pj® ] L and Pi$> l K =r = Pj® ] K . 

Economies of Scope 

Increasing the variety of goods and services offered for sale need not be extremely costly 
for a firm when it is possible to share components or use the same facilities and person- 
nel to produce several products. An assembly chain in a car factory, an oven in a bakery 
or a train track can all be used to produce different products or services in response to 
evolving demand. There is thus a potentially large saving on capital costs, the downside 
being the need for more experienced workers and a more flexible organization. Ana- 
lytically, the economies of scope occur if C{q\,qi) < C{qi,0) + C(0, qz> where q\ and qz are 
quantities of variants of the same product (produced within the same plant or office). 

Economies of scope are also present in all service industries where demand is sea- 
sonal for a trivial reason: you cannot consume peak and off-peak services at the same 



time, thus the capital goods are never in rivalry to serve demand. It is more economical 
to have a single large hotel on the beach open all year and use only the first floors dur- 
ing winter rather than having the same hotel opened only during summer and a small 
boardinghouse aside opened only during winter. Seasonality is more thoroughly studied 
in §25.1. 

Constant Elasticity of Substitution 

With the Cobb and Douglas (1928) technology, the relation between productivity of labour 
j to relative wage wlp is bound to be linear when labour and product markets are com- 
petitive; indeed, profit maximization yield wlp = Ojr, = a®. Now, econometric studies cov- 
ering production data from many industries and countries have shown an increasing but 
concave relationship. If we write the equation in logarithm (normalizing p to unity) as 

ln- = \na + b\nw + e (2.10) 

L 

then the parameter b is empirically lesser than unity. If the relation is exact (i.e., e = 0), 
then b is the elasticity of labour productivity y = j to wage w (this is a simple conse- 
quence of differentiating (2.10)). Arrow et al. (1961) then impose constant returns to 
scale and look for all the production functions such that the elasticity of substitution is 
constant: they obtain the Constant Elasticity of Substitution technology (cf proof). 

2.2 Demand 

We shall always assume a large number of potential buyers for the good or service under 
consideration. These purchasers can be households at the retail level or firms at the 
wholesale level; each individual demand is the result of the maximization of the rele- 
vant objective, either utility 13@ for end-users or profits for firms. The sum of individual 
demands form an aggregate demand that negatively relates market price and total sales. 
In this section, we first study succinctly the theory of preferences, utility and indi- 
vidual demand. Then, we study the properties of the aggregate demand function and 
define two key concepts for the analysis of markets: revenue for firms and surplus for 
consumers. 



2.2.1 Utility 

Preferences and Utility 

How many chocolate bars would you exchange for an apple? How many apples would you 
give up for a chocolate bar? If you go to an apple auction, how many chocolate bars would 
you bid for an apple? These examples involve the idea of substitution between goods. If 
your answer to the last question is 3, you should answer 1/3 and 3 to the previous ones 
in order to be consistent. The dimensionless number 3 is your rate of substitution from 
chocolate bars to apples while 1/3 is your rate of substitution from apples to chocolate 
bars. Considering smaller quantities (as if apples were divisible into tiny quarters), we 
obtain the central concept of consumer theory, the marginal rate of substitution (MRS). 
Unlike this example seems to suggest, the MRS need not be constant. In most cases, it 
changes as the quantities of both goods hold by the individual change. For instance, if 
you now own more apples than previously, then your MRS from apples to chocolate bars 
should fall from 3 to, say 2. 

Modern economic theory assumes that goods and services can be clearly identified by 
labels j running from 1 to A: and measured by quantities Xj in well specified units so that 
we can meaningfully speak of a basket x= {Xj)j<k of goods (the supermarket analogy is 
useful for illustrations). Next, it is assumed that consumers are empowered with the 
ability to compare all baskets; more specifically, for each individual there exists a binary 
relation > where x > y means "x is preferred to y by the individual". These preferences 
satisfy: 

Completeness either x>yory>xorx~y 
Transitivity if x > y and y > z then x > z 
Monotony if y adds 1 unit of one good to x then y > x 
Convexity if z = ^ then z > x and z > y 

Then, one can demonstrate the existence of a utility function u representing the con- 
sumer's preference in the sense that x>y o u(x) > u{y) which eases the task of working 
with the preferences of consumers. By monotony, the marginal utility of good j is positive 
i.e.,« yS ^ >0 .i4» 

The indifference curve through x is defined as the set of all baskets y indifferent to x 
for the consumer i.e., the solutions in y of the equation w(y) = u(x). If we pick y = x+ dx 
(a small variation), then u(y) = u{x) + uidxt + ujdxj so that to maintain utility constant, 

— dx ■ 

we need = mdxt + Ujdxj => -^- = -^ > 0. Observe now that if we set dxt = +1, the ratio 
of marginal utilities is exactly how much good j the consumer is ready to sacrifice to 



get one more unit of the good i, hence the marginal rate of substitution from j to i is 

It is obvious that the utility concept is a non tangible measure of a consumer felicity 
since any increasing transformation of u gives rise to another acceptable utility function 
(e.g., take v{x) = u(x) 2 ); it thus make no sense at all to say "utility increased twofold" 
as opposed to the same statement relative to income. Because of this multiplicity, one 
could worry about the coherence of our definition of the MRS. Hopefully the latter is 
independent of the choice of the utility function because whether u or v represent the 
preferences of the consumer, we have ^ = ^. 15@ 

Market Choice 

Introducing markets for all goods and services, prices p = [pj)j<k and income w for the 
consumer, we are able to tackle the selection of the optimal basket which will procure 
the greatest satisfaction. Letting p.x = T.jskPj x j denote the cost of basket x at prices p, 
the maximization of utility under the budget restriction amounts to solve 

{max x w(x) 
(2.11) 
s.t. p.x< w 

To solve (2.11), we first observe that the constraint trust be binding at the optimum 
for otherwise the money leftover could be used to buy more of any good and further in- 
crease utility (by the monotony property of preferences). Next, we apply the "manna" 
trick and derive an equi-marginal principle (cf §2.1.1). We divide w into many small 
gifts of one monetary unit. Each gift allows to buy y_ units of good ;' and get y. additional 
utility. In order to successfully apply the manna trick, we must assume decreasing mar- 
ginal utility i.e., Ujj < because marginal utility Uj must decrease when money is used 
to buy good ;' in order to reduce the wedge with other employments. At the optimum, 
when all w has been spent, we have 16@ 

*=!* = =!* (212) 

Pi Pi Pk 

analytically identical to (2.3). Furthermore, this constant is the marginal utility of in- 
come V'{w) because this is exactly how much final utility increases when income rises by 
one unit. 17 ® From (2.12), we see that the consumer equates his RMS to the price ratio, 
hence the equi-marginal principle holds economy wide across all consumers. 

At the solution of (2.11), each quantity of good depends on all prices and the income 
but not on the particular utility function used since only the RMS matter and it is inde- 



pendent of the particular utility function representing the preference of the consumer. 
The money -metric utility or minimum income function, u{x) = minp.z, avoids introducing 

z>x 

arbitrary utility functions, thus enables comparisons among consumers at the cost of 
being based on the current set of prices. 

For the practical resolution of (2.11) (e.g., exercise, numerical estimates), Lagrange 
(1797) adds a new variable A, the multiplier and builds the (now called) Lagrangian 
££(x,X) = u{x) + X{w-p.x). It appears that the FOCs for a saddle point of 5£ are Uj = Xpj for 
j < k and p.x= w i.e., exactly those characterizing the original solution. 18 ® In economic 
applications of the Lagrange method, the multiplier is called the shadow price (of the 
constraint). An interpretation sees an auctioneer coming out of the shadow and offer to 
pay an interest A for the slack money z - w - p.x; the consumer then freely maximizes 
5£ over x. Since this possibility is just a mind experiment, the two approaches coincide 
only if the consumer neither wish to lend nor borrow at his optimum. We thus obtain an 
additional condition on the shadow price, z* = 0. Under this interpretation, the shadow 
price at the optimum appears to be the marginal utility of money. 

Substitution and Complementarity 

We present here an example which also serves as the foundation of Singh and Vives 
(1984)'s oligopoly model studied in §5.2.3. 

Varieties 1 and 2 of a good are sold alongside an aggregate of all other goods called 
money m. The representative consumer utility is U{qi,q 2 ,m) = m + q\ + q 2 -\p x q\-\p 2 q\- 
yqiq 2 . Given his income M, the consumer maximizes U{qi,q 2 ,m) under his budget con- 
straint M<R-p\q\-p 2 q 2 . His demand {q\,q 2 ) satisfies the FOCs over quantities q\ and 

( />i = l-Mi?i-7<fe Vj-T-ViPi+YPj . . io 

i => qi = — 5 fori = 1,2 

I Pz = l-p 2 q 2 -yqi P\\h.~y 

We assume p l p 2 > y 2 to guarantee that demand be decreasing in own-price and \p t \ > \y\ 
to guarantee that demand be positive for zero prices. Summing over all consumers, 
the demand addressed to firm i = 1,2 is Di{pi,pj) = at - bipt + dipj for some coefficients 
at > 0,bi > 0, \bt\> \dt\ and dt the sign of which is given by y if it is the same for all 
consumers. 

Market Demand 

Demand theory, by interrelating the various goods and services is useful to understand 
the process of substitution that a consumer might embark on in response to a change of 
price or of income. That knowledge is crucial for a firm wishing to anticipate the effect 



that raising her price will have on the demand she receives from consumers or for a 
government wanting to know the effect of increasing taxation (for whatever purpose) on 
consumers behavior. The general rule is that a change in price of one good, say cheaper 
gasoline, generates a wealth effect (the purchasing power increases) and a substitution 
effect (gasoline consumption increases because it is relatively cheaper than substitute 
energies or transportation modes). 

We shall often isolate the relationship between the quantity xj of a good and its price 
Pj, thereby obtaining our workhorse, the individual demand curve which is decreas- 
ing. 19@ Generally, we treat changes in other prices or income as shifts or movements of 
the entire demand curve. A good is complementary (resp. substitute) to another if in- 
creasing the price of one causes the demand for the other to fall (resp. increase). A good 
is deemed inferior if the curve goes down when income increase; among normal goods 
(i.e., not inferior), luxury goods are those for which total spending increases faster that 
the buyer's income. 20 ® 

Although the buying power of l€is independent of wealth, its "satisfying" power is 
not. Once we get to satisfy our basic needs, we look forward to consume more onerous 
goods and services. Thus, the richer we are, the less valuable is a 1€ coin. This observa- 
tion which dates back to Aristotle is a major assumption of economic theory: economic 
agents display decreasing marginal utility of wealth. The relation with the previous the- 
ory of rational consumption is that holding prices constant, the indirect utility derived 
from the optimal basket is a function v{w) of his income which satisfies v < < v . 

As further reading, the website of the Association for Consumer Research provides 
entertaining studies such as: 

• how we build our preferences 

• why smoking has the status of a forbidden fruit for teens 

• whether time is money for everyone 

• why do we spend so much on weddings 

2.2.2 Willingness to pay 

Dupuit (1844)'s concept of willingness to pay (WTP) plays a central role in this book and 
thus warrants some space. We introduce the idea using the original example of this 
author and then relate it to the demand. 

Imagine that continuous improvement in technology enable to reduce the price at 
which water is supplied to a city; inhabitants will start to consume more and more water 
for less and less pressing uses as shown in Table 2.1. From this observed behavior, we 
deduce the resident's willingness to pay for additional cubic meters of water. We can 



therefore attribute a personal and monetary value to each cubic meter consumed by the 
resident noticing that each additional unit is devoted to satisfy an evermore futile need. 



Price 


Qty 


Marginal use 


WTP 


5€/m 3 


lm 3 


drinking, washing 


> 5€/m 3 


3€/m 3 


4 m 3 


weekly car wash 


3 to 5€/m 3 


2€/m 3 


10 m 3 


daily garden watering 


2 to 3€/m 3 


l€/m 3 


20 m 3 


running water fountain 


1 to 2€/m 3 



Table 2.1: Water Use 



These observations confirm Aristotle's claim (expressed in modern words) that margi- 
nal utility diminishes with consumption. As a direct consequence, the demand for water 
of the resident increases as the unit price diminishes. Notice however that the demand 
for a free good will never be infinite because there is an opportunity cost of consuming 
it. Indeed, there always exists alternative and competing ways to use our time, so that 
instead of spending the whole day watering his lawn, the resident will use some time to 
mow. These simple deductions hold for any good or service. 

To understand the aggregation of individual demands, we recur to a new example. 
The left panel of Figure 2.4 represents the monthly demand schedule for movie rentals 
of a consumer, say Julie. If the price last month p was such that she rented 5 movies, it 
must have been the case that her willingness to pay for a sixth one, p§, was lesser than 
p for otherwise she would have rented that sixth unit; her pattern of behavior also tells 
us that her willingness to pay for the fifth one ps was greater than p. 2l@ If the price this 
month rises a bit above p 5 , she reduces his consumption down to 4 rentals because the 
fifth one is not worthwhile anymore to her. By moving up and down the price, we see 
that her demand curve is also a decreasing willingness to pay curve. 

Summing the individual demand schedules of all consumers for that market, we ob- 
tain a smooth decreasing relationship between quantity and price Dip) called the market 
demand as shown on the right panel of Figure 2.4; the only change from a graphical point 
of view is that the scale of the horizontal axis is now in thousands of monthly rentals. 22 ® 
The inverse of the demand function is the decreasing price function P{q) which exactly 
measures the willingness to pay for an additional unit of the individual who bought the 
last unit (#q). For that reason, the inverse demand is rightfully called a (market) willing- 
ness to pay function. All our examples in this book use a linear specification for demand 
with 



D(p) = a-bpoP{q) 



a- q 



(2.13) 



Parameter a is a proxy for the market size since it measures exactly how many units 
would be consumed if the good was free. Parameter b then measures the reactiveness of 
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Figure 2.4: Demand Schedule for Movie Rentals 



demand to changes in prices. 23 ® 

An important concept for practical purposes is the elasticity of demand with respect 
to price which is the percentile variation of demand in response to a percentile variation 

of price; formally, 

p P{q) 



ADiAp 
D p 



= \D'{p)\ 



(2.14) 



Dip) q\P'iq) 

The consumer demand is said to be elastic if e > 1, isoelastic if e = 1 and inelastic other- 
wise. Be very careful that the elasticity varies along the demand curve. In our linear 
example, we have e 
inelastic otherwise. 



bp 
a-bp 



^-^, so that the demand is elastic meanwhile q < f and 



2.2.3 Market Revenue 

The total revenue of sellers is by definition the total expenditure of buyers and is called 
the market revenue. Since sales depends on price, so does revenue; the useful idea of 
Cournot (1838) is to relate market revenue with sales by defining R{q) = qP{q). The 
marginal revenue is the additional revenue accruing to sellers when an additional unit 
is sold. Formally, it is the derivative of revenue with respect to sales; it satisfies 



R m iq) = P(q) + qP'iq) = (l - ±) P(q) < P(q) 



(2.15) 



This concept, complementary to that of marginal cost, will prove crucial to study the 
performance of a market independently of the number of firms; it makes clear that the 
revenue of an industry is not a free variable, it depends on the consumers and their 
preferences for this good versus the alternatives or substitutes they can have for their 



money (in case they refrain from buying that one). 

2.2.4 Consumer Surplus 

Unit Surplus 

When looking at a product in a store, we can always ask ourselves how much we would 
be ready to pay to get it. If this subjective price, say 10€, is greater than the objective 
(market) price, say 8€, then we would be right to actually buy the item; in doing so we 
would derive a surplus of 2€ 24@ . The force of this simple comparison is to measure in 
monetary terms something that is difficult to gauge: the excess of satisfaction we derive 
from consuming the item instead of using the 10€ to buy a worse alternative. 

Dupuit (1844) offers with the concept of consumer surplus to decompose an individ- 
ual consumption into a sum of marginal choices. 25@ Consider Julie's demand for movie 
rentals displayed on the left panel of Figure 2.4. Applying the previous idea, we can say 
that her surplus of the first rental is p\ - p, that of the second rental is pz - p and so 
on until the last rented unit for which the surplus is almost zero (because p$ and p are 
close). 

Individual Surplus 

Julie's consumer surplus is then simply defined as the sum of unitary surpluses for all 
the units she consumes: 

5 

{pi ~p) + {p 2 ~P) + {P3 ~P) + (P4 -P) + {P5~P)= E iPj{X) - p) 

x=l 

where Pj is Julie's WTP function. When the good is infinitely divisible, the (individual) 
consumer surplus of consumer ;' is 

rDjip) />+oo 

c Sj (p)= (Pj(x)-p)dx= Dj(y)dy (2.16) 

Jo J p 

The reader can check the validity of (2.16) on Figure 2.4 using vertical bars and hori- 
zontal ones for integration. The raw surplus summing the WTP of all units consumed 
is called utility or TWTP (total WTP); 26@ it is, by construction, independent of the price. 
The integral form is 

Uj{q)= I P,{x)dx (2.17) 

Jo 



Consumer Surplus 

The (aggregate) consumer surplus at price p is simply the sum of all individual surpluses 
and since market demand sums individual demands (D = E;<fc-Dy), we obtain 27 ® 

W D (p)=j: csj(p)= D(y)dy (2.18) 

with W' D {p) = D{p) i.e, margin surplus is demand. 

This seemingly trivial step hides an important normative choice: interpersonal utility 
comparison. By adopting a utilitarian concept, we are implicitly assuming that l€ gives 
the same satisfaction to everyone, which is clearly not the case when comparing a bour- 
geois and a pauper. Marshall (1890) argues however that this neglect of income and 
substitution effects is acceptable for goods and services that represent a small share of 
an individual's expenses (cf Willig (1976) for an exact derivation and Binmore (2007) for 
a discussion). 28 ® Notice that whenever the item under consideration is an intermediate 
good like "crude oil" or a service like "office cleaning", its buyers are firms and their con- 
sumer's surplus is in fact a monetary profit so that the previous critic has absolutely no 
bite. 

One could also worry that the calculus of W D (whose notation shall become clear in 
§2.3.2) depends very much on the shape of demand at very low quantities, an information 
almost impossible to obtain. This is irrelevant because the practical use of the concept 
involves comparison of two nearby situations, hence depends only on the slope of demand 
around the current point of consumption. 

Finally, we use the duality of demand and willingness to pay, q = D{p) o p = P{q) to 
express the consumer surplus as a function of the total quantity absorbed by the market, 

W D {q)=\ D(y)dy= (P(x) - P(q))dx (2.19) 

JPiq) JO 

Observing 29 ® that W^{q) = -qP'{q), we can rewrite equation (2.15) in a way that will 
help us understand the issue of market power in the next chapter: 

R m {q) = P{q)-W' D {q) (2.20) 

Equation (2.20) also tells us how the price of an additional unit (P) is divided among 
sellers (R m ) and buyers (W' D ); it makes clear the impossibility for sellers to appropriate 
themselves the entire willingness to pay of buyers if the transaction takes places in a 
market (i.e., all units are sold at the same price). 

In the linear demand example, we obtain W D {p) = (fl ~ fa p) or W D {q) = |^ (check this 



formula using the right panel of Figure 2.4). 

2.3 Equilibrium 

This section, inspired by Marshall (1890), 30@ considers the market for a single good or 
service. We shall derive the aggregate supply and match it with the aggregate demand to 
deduce the equilibrium and its properties. Then we relate these findings to the theories 
of general equilibrium and welfare in order to infer the efficiency tool that will be central 
in this book: the market welfare. 

2.3.1 Producers Surplus 

We saw previously that the individual competitive supply of a firm with decreasing re- 
turns to scale 31 ® is an increasing function of the product's price. When firms i = l,..,n 
are active in this market, their aggregate competitive supply is Sip) = £;•<„ $iip) where St 
denotes the supply function of firm i. If, for example, all firms share the same technol- 
ogy with decreasing returns to scale characterized by the cost function C l iq) = ^-, then 
the common marginal cost is C l m iq) = cq so that the common supply function is Slip) = £; 
lastly, we deduce the aggregate supply Sip) = ^r. 

More generally, the aggregate supply curve S is increasing and its inverse, denoted 
C m , can be rightly called the marginal cost curve of the industry. Indeed, if all firms were 
subsidiaries of a single (monopolistic) firm, her marginal cost would be exactly C m . 32@ 
The producers surplus which sums the surpluses of all producers can now be precisely 
defined as the sum of the marginal profits made on each sold unit 



,q)= [ {p-C m ix))dx (2.21) 

Jo 



W s ip 

When the quantity is the aggregate competitive supply q = Sip), the producers surplus 
is given by the area between the horizontal price line and the industry supply curve 
from zero to the quantity supplied at that price. It should be noticed that the consumer 
surplus depends on either p or q but not both since the point ip, q) belongs to the demand 
curve; this is so because consumers have no market power. The producer surplus, on 
the other hand, depends on two free variables because producers may exercise market 
power which means that the point ip, q) need not pertain to the competitive supply curve. 
Compare for instance the pairs ip\,q) and ip2,q) on the left panel of Figure 2.5 below. 



2.3.2 Equilibrium and Welfare 

The equilibrium price p* is reached when supply equals demand as can be seen on the 
right panel of Figure 2.5 where the competitive demand and supply schedules are plot- 
ted. It is readily observed that the equilibrium occurs for the quantity q* that equates 
the willingness to pay P{q) of consumers with the industry marginal cost C m {q) = S^(q). 
In the example where D{p) = a-bp and S{p) = ^-, it is a simple exercise to find p* 



bc+n' 



the total quantity produced is q* 




£^ and the individual one is q\ 

AP 



bc+n 



for k < n. 




Figure 2.5: Competitive Equilibrium 

As we explain in the next subsection, the key concept for the normative analysis in 
this book is that of total surplus or welfare which adds the surpluses of consumers (2.19) 
and producers (2.21): 



W(p,q) = W D (q) + W s (p,q) 

(P(x)-C m (x))dx if q = D(p) 
Jo 



(2.22) 



At the competitive equilibrium {p*,q*), welfare reaches its maximum denoted W* = 
W{p*,q*). Indeed, whenever the price is p\ or p 2 different from p* , the traded quantity 
q is the minimum of demand and supply which generates a welfare loss shown by the 
hatched triangle on the left panel of Figure 2.5. In equilibrium, the distribution of wel- 
fare between consumers and producers is given by the equilibrium price, the large upper 
triangle is Wd while the large lower triangle is W§ (cf. right panel of Figure 2.5). In our 
linear example, the welfare at the equilibrium can be computed and simplifies into 



W* = W D (q*) + n{q*p*-Ci(q*)) = 



1 na 2 



(2.23) 



2b n + be 
If the industry marginal cost is constant (flat supply curve), a price rise Ap above 



the efficient level p*, due for instance to a tax, will reduce the quantity exchanged by 
Ag = -b/S.p so that a welfare loss of ^ApAg = ^(Ap) 2 is generated. 33@ 

The competitive price p* implements an allocative efficiency i.e., the "right" number 
of units is produced, sold and consumed. We can also take the longer view and wonder 
how many firms ought to participate in this market i.e., seek dynamicefficiency. The 
answer is not "as many as possible" because some scarce resources are needed to set up 
a firm, there is a fixed cost. The efficient number of firms is then that which maximizes 
the long term welfare net of entry cost W* - nF i.e., one has to balance the fixed cost of 
introducing an additional copy of the technology with the welfare gain derived from its 
use by lowering the industry supply curve. This issue is studied in §6.1 on the dynamic 
of entry and exit. 

2.3.3 Welfare Economics 

As we already noted, our normative approach is rooted in the utilitarian concept of con- 
sumer surplus. Our main instrument to assess allocative efficiency and judge among 
alternative outcomes is market welfare W, the contribution of the market under study 
to national income (GDP). This criteria for guiding social choice is not innocuous since it 
justifies taking away everything from someone if he is blocking the progress of society, 
whether he is rich or poor. This is quite the opposite of the rawlsian approach aiming at 
benefitting first the person with the lowest level of satisfaction in society (cf Dasgupta 
(2007) for a discussion). 

A more technical issue is the fact that W is a partial equilibrium concept. We thus 
need to relate it to the findings of general equilibrium theory in order to assert its apt- 
ness. Indeed, it could be the case that restoring efficiency in one market through reg- 
ulation, taxes or breaking-up a monopoly creates worse distortions in related markets. 
The second best theory shows formally such a possibility; however, this fear has never 
materialized itself beyond contrived general equilibrium models with no bearing upon 
reality 34 ® In practice, when two markets are strongly interdependent (e.g., car maker 
and car electronics), the partial equilibrium analysis can be augmented to account for 
their relationship; total (market) welfare then sums those of the two markets. We may 
therefore synthesize our normative approach as follows: 35 ® 

The received paradigm of Industrial Organization states that accurately defining the 
relevant market and solving its inefficiencies increase the welfare possibilities of the 
entire economy, given the current redistribution mechanisms at work. 



2.4 Game Theory 

Firms often collaborate to create value but then compete to divide it up. Game theory, 
using mathematical formalism, tackles these interactions known as cooperative and non 
cooperative (Aumann (1985) offers a panorama on the aims and method). We briefly 
present the equilibrium concepts that enable the analysis of simultaneous interactions; 
simultaneity here means that an economic agent has no information relative to the ac- 
tion of his challengers, only guesses. We then move on to concepts related to sequential 
interactions, the main novelty being that each decision maker is able to observe some or 
all of the actions taken by those who played before himself. Holding such an information 
can be advantageous for me, as a player, because I am able to make "better informed" 
choices. Yet, as we shall see, it can also be a limitation because everybody knows what 
I know myself, thus everybody can guess what my course of action will be, as if they all 
possessed more information. The last section deals with cooperative settings, negotia- 
tions, bargaining and problems of wealth division. 

2.4.1 Simultaneous Games 

Reduction 

A game can involve any number of players 36@ but since interactions in 10 are typically 
bilateral, we shall content ourselves with two players. Furthermore, we reduce the inter- 
actions to the bare minimum of a binary choice. The graphical representation displayed 
on the left panel of Figure 2.6 leads to speak of the row (R) and column (C) players 
whose strategies are respectively north (N) or south (S) and east (E) or west (W). The 
geographic labels have obviously no meaning and serve to underline the power of ab- 
straction of game theory. Each player holds preferences over the four possible outcomes 
{NE,NW,SE,SW} synthesized by four utility levels or payoffs. A 2 x 2 matrix game is thus 
characterized by 8 figures. If the gain of one party is exactly the loss of the other, the 
game is zero-sum. When parties share a prize (it may have negative value as in war), 
the game is constant-sum. 
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Figure 2.6: Game in Normal Form 



We can now apply Stinchcombe (2007)'s powerful simplification to classify 2x2 matrix 
games. Whatever the equilibrium concept, it appears that the only thing that matters is 



the utility differential contingent on the opponent's action. Thus, adding a constant con- 
tingent on the opponent's action, is neutral for the equilibrium analysis. We may then 
zero the out-of-diagonal payoffs for both players. The resulting payoff matrix, synthe- 
sized by just 4 figures {a,b, c,d}, is shown on the right panel of Figure 2.6: the first entry 
is the row player's payoff while the second entry is the column player's payoff. When 
the interaction of the players is worthy of analysis, each is sensitive to his opponent's 
action, hence none of the 4 figures is zero. This means that figures are either positive or 
negative, so that we have 2 4 = 16 different games. 

Strategic Classification 

The analysis of strategic interaction displayed on Figure 2.7 uses arrows indicating the 
desire of players to change their action, conditional on the opponent's action. A © occurs 
as soon as one player would prefer to leave the entry while a © indicates a stable outcome 
known as a Nash equilibrium. When the two arrows of a player go in the same direction, 
they indicate a dominant action i.e., one action pays more than the other independently 
of what the opponent does (the column or row is crosshatched). 37@ This situation occurs 
each time the payoffs for one player have opposite signs. Checking out the possible 
combinations, we have 38@ 

O Both players have a dominant strategy; we may erase the dominated row or column 
for each. One entry remains. It is a dominant strategy equilibrium. 

© One player has a dominant strategy; we may erase the dominated row or column 
leaving one player choosing his preferred action. There is a unique stable outcome. 

© Multiple equilibria either in the diagonal or off it. The corresponding issues are, 
respectively, coordination and leadership. 

© No stable outcome as optimal choices cycle clockwise or counter-clockwise. 
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Figure 2.7: Games in Normal Form 



cycle 



Economic Classification 

We now analyze the previous cases from an economic point of view using convenient 
payoffs to convey better intuitions. 



® Prisoner Dilemma The following situation is typical of life in societies (biology, 
sociology, psychology and economics). Two parties would like to agree on some course of 
mutually beneficial action but each sees that particular action as a dominated one, thus 
plays the other one, triggering a bad outcome. The agreement cannot be carried because 
there is no way to enforce it. 
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Stand-up trap 



Prisoner Dilemma 



Advertising 



If we all sit or stand, we watch the show equally well, the only difference being 
exhaustion (2 > 1). If one rises when others sit, he sees much better (4) at the 
expense of who sits behind (0). Standing is the dominant action for both. 
The light Hawk-Dove game of biology uses the same matrix: a resource of value 
4 can be shared peacefully (Dove attitude = Sit) or contested (Hawkish attitude = 
Stand). If both contest, half of the value is destroyed. Hawkish attitude is domi- 
nant. Notice that if too much is destroyed, then we have a heavy version which is a 
game of leadership (cf ®). 

Prisoner Dilemma (Original version): payoffs are negative since players evaluate 
jail terms. Denial by both suspects entails a smaller jail term (2 years) than mutual 
confession (5 years). Yet, rating on the other (lone confession) is rewarded by free- 
dom for the informant and by charging the betrayed accomplice with the 10 years 
of jail. Betrayal (rating) is the dominant action. 

Prisoner Dilemma (Business version): pure advertising. The first to advertise gains 
market share at the expense of his challenger. Then, the loser starts advertising 
too in order to regain his clients. Once all advertise, market shares return at their 
initial level but advertising expenses have been incurred, so all lose. 
Trust in commerce: when parties are not face to face, it is a dominant strategy for 
each to renege on its promise which is to send the good for the seller and to send the 
payment money for the buyer. If so there is no trade which is why trust building is 
so important. 



© Second Mover Advantage This is the least known case, probably because it is the 
least frequent one. In some situations involving weak W and strong S players, the high- 
est earner can be the weakest as shown by Markides and Geroski (2004). The reference 
situation has payoffs normalized to zero. Investment into a copiable innovation costs 2 
and generates a market value of 8. When both firms invest, S grabs 6 out of 8 so that pay- 
offs are 6-2 = 4 and 2-2 = for W. When a single firm invests, she is left alone polishing 
the new product and supporting the cost of its final development while her competitor 
is then able to copy for free the final design. This means that the distribution of the 
market value 8 is tilted against the investor by one unit. If only the weaker firm invests, 
she gets 1 and earns a net 1-2 = -1; S earns 7. If only S invests, she gets 5 and earn 
5-2 = 3 while S earns 3. Waiting thus appears to be a dominant strategy for the weaker 
player. In practical terms, W can successfully convince S that he will never invest, thus 
forcing him to go alone. The key to the result is the -1 entry i.e., that investing only to 
be overtaken by the strong follower is not worthwhile for the weak player. 
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© Coordination In games of coordination, there are several equilibria. 39@ 
wrt. how players rank these outcomes. 
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• Rousseau (1755)'s Stag Hunt illustratescoordination issues in society: a lonely 
hunter can catch a hare but it takes two hunters to catch a valuable stag (deer). 
If one is going to hunt hare, it is pointless for the other one to try to catch a stag. 
There are two equilibria and both players rank them likewise; yet if there is a 
history of hare hunt, stag hunt is unlikely to emerge. 

• Bank run (macroeconomic version): I keep my deposit in the bank if everybody does 
so. Yet, I rush to take it out once I become convinced that everyone will do the same. 
There are two equilibria, chaos and stability, that savers rank alike. 

• Insurance or Mutualization: a risk, unlikely to hit all members of a community 
at the same time, can be eliminated if all contribute their actuarially fair share of 



the expected cost. Yet, there is another equilibrium where none contributes i.e., a 
coordination failure. 

• Battle of the Sexes: Spouses prefer to spend their evening together. Joint activities 
are the only (pure strategy) equilibria but since each partner has an ideal activity, 
they prefer different equilibria. 

• Joint venture: in this business equivalent, firm join forces to develop a product 
around a core but each would like to impose his original technology as the core. 

© Leadership Although strategically identical to games of coordinations, games of 
leadership display asymmetric behavior in equilibrium. There are two stable outcomes, 
inversely ranked by the participants so that again, history is likely to decide who gets to 
pick the equilibrium. 
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• The Chicken game illustrates political science and economics; outcomes are ranked 
as follow: the best is to be strong against a weak opponent (3), then both weak (2) 
because it is a sage attitude where no one looses face, followed by being weak in 
front of a strong one (1) as one is ridiculed (called a "chicken"). Finally, mutual 
fighting has dire consequences (0). As noted above, the Hawk-Dove game with 
destruction outstripping initial value is a game of chicken. 

• The Snowdrift game illustrates public good finance: both parties should remove the 
snow but it is always tempting to free ride on a volunteer. However, the action is 
still beneficial for its author (as opposed to ®), so that two asymmetric equilibria 
exist. 

• In the Local Monopoly game, firms can either produce ham or cheese flavored crack- 
ers. If they produce the same kind, they fiercely compete "a la Bertrand" (cf. §5.2.1) 
and earn zero profit whereas if they differentiate their crackers, each enjoys mar- 
ket power and profits (cf. §5.2.2). Still, consumers favor cheese-flavored crackers so 
that the producer of this particular type earns more (2 vs. 1). 



© Cycling These games of strategic interaction have no pure stable outcome since 
there is always a player willing to change his action given what the other is about to do. 
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• In the Matching Pennies, players show the Head or Tail of a coin; if the faces agrees, 
R pays l€to C, otherwise C pays R. Rock, Paper, Scissors is an extension to more 
strategies. 

• Auditing between the IRS and a tax payer involves a cycle: if I declare truthfully 
my income, the IRS has no point in auditing me, thus I'd better start evading taxes 
but then the IRS should audit me systematically at which point i'd better abide to 
avoid punitive damages. 

• Alternative settings: in the workplace between a controller (potential auditor) and 
an employee (potential shirker), safety regulation between firms and the inspector. 
If the cost of behaving is too high wrt. the expected penalty from being caught then 
shirking becomes a dominant strategy and we are back to ® (the outcome depends 
on the auditor incentives in front of a known infraction). 

• Imitator vs. Innovator: if firms choose the same product (C imitates R), they enter 
a Bertrand competition that C wins thanks, say, to its cost advantage in production. 
If on the other hand, firms choose different products then R's innovative approach 
to branding enables him to win the entire market. The idea here is that each firm 
has a specific know-how and would like to turn it into a market advantage but this 
outcome depends on the behavior of the competitor. 

The Nash equilibrium (cf below) of cycling games sees each player mixing between 
his two actions in the proportion that makes the other player indifferent between his 
own two actions. Using the reduced form with payoff parameters a,b,c,d, the conditions 

are 

a s a+{l-a s )0 = a s + {l-a s )c \ _ [ a s = -^- c 

a R b+{l-a R )0 = a R + {l-a R )d J [ a R = ^ 
which are well defined here given that a and c have the same sign (likewise for b and d). 



General Theory 

A game in "strategic form" r consists of a set of players i = 1,..., n, a set of strategies Si for 
each player i, a function g that maps a full profile of strategies s = {st)i< n into an outcome 
d = g{s) and finally a utility function m for each player i that is defined over the set of 



possible outcomes D. Notice that we may consider the payoff function m(s) = in[g[s)) in 
order to analyze the game. We denote s_ ; = ($/)_/# the vector s without the i th coordinate 
St, hence for all i < n, we always have s = (s,-, S-t). 

In a game r, each player is rational, thus looks forward to choose the strategy that 
gives him the largest payoff. An optimal strategy for player i also known as a best 
response depends on the vector s_ ; of strategies that other players are using; a best 
response is a particular strategy it e S; that maximizes his payoff jii{Si,S- {] when varying 
si overall possibilities from Si. A best response need not be unique. The short notation 40 ® 
is BRi(s-i) = argm&x Si ni{Si, s_ ; ). Thus, to pick his strategy, a player must account for what 
others are doing themselves i.e., he has to form a belief about the others' actions. We 
assume that each player knows his opponents will behave rationally, like him. 

We can now define a Nash Equilibrium as a profile of strategies s* such that no player 
i can do better by choosing an action s ; different from s* , given that every other player j 



adheres to s*. In formulas: 



Vz'< n,Vsi£Si, ni[Si,s_i)<7ii[s*) 



Alternatively, a Nash equilibrium is a fixed point of the best response correspondences 
i.e., for all i, s* belongs to BR t {s^ .). The Nash equilibrium epitomizes a stable "social 
norm": if everyone else adheres to it, nobody wishes to deviate from it unilaterally. 

A strategy for each player is a complete plan that specifies a move at each stage where 
he is active, as a function of his information at that stage. A rational player can choose 
his strategy before the game begins, with no loss of generality. Indeed, a strategy lets 
him specify a different move for every situation in which he might find himself during 
the game. In other words, he need not wait to observe the consequences of other players' 
decisions to decide on his own in response, he can anticipate every single possibility 
and write down in his plan, what he would do in each conceivable case. Hence, each 
player designs his strategy without being informed of the other players' strategy choices. 
The concept of Nash Equilibrium therefore presumes that all players remit an envelope 
containing their strategy (the master plan) to an referee who will open them and apply 
the instructions they contain to perform a play of the game. 

Regarding existence, there are often more than one Nash equilibrium which is prob- 
lematic for economic interpretation as we have no clue on which one to pick. In some 
case, as the matching pennies, there is no Nash equilibrium and we must introduce 
mixed strategies (i.e., enlarge the game) to find an equilibrium. The idea is that players 
use a statistical distribution over the set of strategies S. It has been shown by Harsanyi 
(1973) that this is akin to pick a strategy with certainty in a game where one lacks 



information regarding the payoff of others. 

2.4.2 Sequential Games 

There are many instances of economic interaction where agents act in a sequential man- 
ner. For instance, one firm enters a market and later on, another can do likewise, or 
pass. The method we shall use to study these interactions and find the Nash equilib- 
rium is called the backward induction. We explain the idea using the example of the two 
farmers. 

Each farmer requires his neighbor's help to harvest his field when the time comes, 
or else half will rot in the field. Since the south-looking field ripens earlier, cooperation 
is technically feasible: N should first help S and later on S should help N. The problem 
lies in the incentive to do that. Plato already recognized that S has nothing to gain from 
helping N when it's his turn to help; he would be better of sparing himself the hard labor 
of a second harvest. Hobbes (1651) went further to suggest that S should not help N 
because it is irrational to honor an agreement made with someone else who has already 
fulfilled his part of the agreement. Of course, if this analysis is correct, then would not 
N anticipate this "double crossing" and act accordingly? This is precisely Hume (1740 )'s 
analysis: since N cannot expect S to return his aid later on, he will not help S in the first 
place when S's corn ripens first, and of course S will not help N when N's corn ripens 
later. 

This story can be modeled as game which is represented in extensive form on the left 
panel of Figure 2.8. The payoffs entries are [tin^s)- The value of a fully harvested field 
is 80; it obtains only if the other farmer helps while if the other cheats, half of the crop 
is lost. To these profits, one must possibly retract 3 which is the value of time and sweat 
lost in harvesting someone else's crop. The selfish solution we previously identified is 
easily understood with the help of the tree representation. We start by looking at the 
last decision to be taken before the end of the game when all decisions are translated 
into consequences and final payoffs. After having benefited from N's help (left side), S 
prefers to cheat to earn 80 rather than 77. Even if he has not benefited form N's help 
(right side), he still prefers to cheat to earn 40 instead of 37. We now move backward 
at the moment where N must decide whether to help S or not. We are assured that N 
knows what S will do because everybody is rational and everybody knows that everybody 
is rational. Hence, N knows for sure that his decision will either lead to the second or 
fourth outcome, hence cheating is also the optimal strategy for him. 

The backward induction enables to identify strategies for both players which form 
a Nash equilibrium of the game which is called subgame perfect (after Selten (1975)) 
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Figure 2.8: Games in Sequential Form 

because it rules out non credible strategies at any node of the tree. 

To clarify this latter idea, consider the competition for cocktail crackers among two 
firms, Wallace (W) and Gromit (G) displayed on the right panel of Figure 2.8. W can 
commit to produce either vinegar (v) or cheese (c) flavored crackers. Then, G acting in 
full knowledge of VK's decision, picks a flavor to produce. If they produce the same flavor 
they fiercely compete and earn a nil benefit while if they choose different flavors, each one 
enjoys market power and derives handsome profits. Still it is well known that cheese- 
flavored crackers are the favorite of consumers so that the producer of this particular 
type earns more (6 vs. 4). The backward induction method remains simple to apply: being 
rational, Gromit will choose v after c and c after v. Knowing that Gromit is rational, 
Wallace can anticipate this course of action and therefore reduce the game tree to the 
second and fourth outcomes; it is then evident that c yields a higher final profit. 

There exists another Nash equilibrium where Gromit commits to "always c" while 
Wallace plays v. Yet this equilibrium is imperfect because Gromit is making a non- 
credible threat when pretending that after observing the choice c by Wallace he would 
still stick to c which we all know would be foolish. The backward induction method 
therefore selects the most natural equilibrium among all possible Nash equilibria. 

2.4.3 Negotiation: Sharing & Bargaining 

One of Smith (1776)'s claim, put in modern language, was that "no big bills are ever 
left on the sidewalk" i.e., opportunities for value creation tend to be identified and real- 
ized. Yet, this outcome, conducive of allocative and even dynamic efficiency, requires a 
negotiation of the stakes among the parties involved. 

This section deals with the sharing of a valuable item and follows the original insights 
of Edgeworth (1881)'s theory of exchange. We begin with a string of examples involving 
two parties which corresponds to the situation most frequently encountered in this book. 



We study the division issue first from a normative point of view i.e., how should it be 
done? Later on, bargaining will approach the problem from a descriptive point of view 
i.e., how shall it be done? In the last section, we show how the presence of asymmetric 
information reduces the efficiency of negotiation i.e., parties sometime fail to reach an 
agreement although it would be socially desirable. 

Bilateral Monopoly 

In a perfectly competitive market, welfare is maximized and distributed among partici- 
pants through the price mechanism. Industrial organization on the other hand studies 
mostly imperfectly competitive situations where actors are free to negotiate over many 
issues such as price, quantity or quality; this happens whenever they identify an oppor- 
tunity for cooperation i.e., an opportunity to take coordinated actions that increase their 
joint profit. Examples are: 

• price to be paid by a city to a builder for delivering a bridge. 

• negotiation between a union and the owner of a factory over wage schemes, sched- 
ules or working conditions. 

• negotiation between the EU and a candidate country for accession. 

• Barter of wheat against milk between a grower and a farmer. 

• bargaining between this book's author and his publisher to set the retail price and 
the royalties. 

• dispute between a polluter and his victim regarding abatement. 

• negotiation between the maker of a component and the prospective buyer of this 
specific item. 

One speaks of a bilateral monopoly because each party has the monopoly or exclusive 
ownership of one good or service that is of interest to the other party. Obviously, the 
value created by engaging into such a form of cooperation has to be shared and this is 
the main point of contention. 

To introduce the bilateral monopoly in a more concrete manner, consider farmers 
whose plots of land are face-to-face in a valley where runs a river, one plot looking south 
S, the other looking north N. If there is no bridge to cross the river, both farmers are 
forced to live in autarky. When the harvest time comes, each lonely farmer hasten to 
reap as much as he can but he nevertheless loses one half of the crop. Assuming that 
the southward oriented land is more productive, farmer S's income e.g., ns = 60, will be 
greater than farmer AT's e.g., tin = 40. Industry surplus is n = ns + n^ = 100. 



If the gentle lord of the valley build a bridge over the river, new opportunities arise. 
Farmers can take advantage of the fact that the southward oriented crop ripens first to 
join forces and make a full harvest in each field in turn, thereby avoiding any loss. This 
way, the total value of the crops is doubled to n* = 200. It remains for the farmers to sign 
an agreement or follow an ancient custom to divide the additional income 6 = n* -n= 100, 
generated by the introduction of the bridge (technical progress). The intuitive solution of 
equal sharing is also the most likely whether we take a cooperative or strategic approach 
to the issue. 



Cooperative Approach 

Let us look for properties that a cooperative or normative solution should satisfy with 
the help of Figure 2.9. Firstly, according to individual rationality (IR), a party will never 
agree to an allocation giving him less than what he gets in case of disagreement. The 
latter is an amount of money that can be interpreted as an opportunity cost of participat- 
ing to an agreement, in other words, what he would be able to earn if he were to perform 
his most rewarding alternative activity. In the farming example, opportunity cost are 
its = 60 and jin = 40. This particular outcome is called the disagreement or threat point. 
Graphically, the postulate says that the final outcome will be inside the triangle with 
origin at the disagreement point. We illustrate this with two examples, a bold dot and a 
black square corresponding to initial surpluses n and n'. 
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Figure 2.9: Pareto improving Bargaining 
A second natural postulate is Pareto efficiency or joint rationality: parties are able to 



identify the outcome maximizing joint profit and choose to carry it on; hence the solution 
is a Pareto optimum. 41 ® This axiom amounts to assume that renegotiation of any agree- 
ment is possible and costless; hence if the current agreement does not maximize joint 
profit, the parties enter a renegotiation round to replace it by a better one. This process 
lasts up to the moment where there is no alternative jointly preferred to the agreement 
in place. On Figure 2.9, efficiency means the final outcome will be on the bold line cutting 
the two axes at n* = 200. 

The sharing problem is thus summarized by the opportunity costs ns and tin (dis- 
agreement point) and the prize for cooperation 6 = n* - n (graphically, the distance be- 
tween the two anti-diagonals). 42 ® aka quasi-rent 43@ Any division of this prize among 
the two parties will satisfy the two rationality axioms; graphically the final outcome will 
be anywhere on the outer segment of the triangle with origin at the disagreement point. 
Of these many outcomes, only one treats both parties democratically or symmetrically; it 
is the Nash solution of equal prize sharing where farmer i receives Xt = Tii + \d for i = S,N. 
This apportionment is said to preserve the differences since %s-ns = x^-tin which means 
that the increase of farmer S's payoff due to the help of farmer N is equal to the increase 
of farmer AT's payoff due to the help of farmer S. 

Non-cooperative Approach: alternating offers 

We now turn to the descriptive analysis. What can we expect to happen between two ra- 
tional decision markers? One farmer, say S, may ask N to pay him 39 in oder to exchange 
a few days of work at harvesting time. Farmer N would be likely to accept since he would 
harvest his whole field for a gain of 80, then pay 39 and earn finally 41 which is more than 
40, the profit in autarky. Farmer S would then earn 120 + 39 = 159. The sum of profits is 
maximum at 200, thus efficiency has been achieved; notice however that the proposer, 
farmer S, grabs most of the prize for cooperation 6 thanks to his clever offer. The weak- 
ness of this reasoning is that there is no reason for farmer N to accept this offer since he 
could work the numbers himself and make an entirely symmetrical counter-offer where 
he would grab the prize 6. 

Rubinstein (1982) studies such a sequence of alternatives offers for the division of a 
monetary prize, normalized to unity, between S and N as follows: S asks a share q$ for 
himself which N can either accept or reject. In the latter case, after one period elapses, 
the roles are reversed and N asks q^ for himself. The alternating offers go on until one 
player agrees to the offer made to him. Let 6s and S^ be the discount factors (cf §19.1.2). 
Notice that a small value indicates impatience. 44 ® 

It can be shown that in the unique subgame perfect equilibrium, the first player to 
speak, say S, offers q* s - ^~g N s for himself which is accepted by N who gets the comple- 



mentary part of the unit prize (cf. proof). One observes that greater patience, 8s > <5jv, 
implies a greater payoff q* s > q* N (comparing S and N as first speakers). If players are 
equally patient, the first speaker earns q* = -^ whereas the other player gets Sq*. If 
both parties become infinitely patient, their share tend to 50% which is the cooperative 
Nash solution. 

Many agency situations studied in Part H display a principal relatively more impa- 
tient than the agent. This is the case for a government, a public body or a CEO on is- 
sues where swift decisions are expected by stakeholders (e.g., consumers, voters, clients) 
whereas the agent is a contractor who is not immediately threatened to go bankrupt by 
a delayed payment. The negotiation regarding the final payment associated with the 
delivery of the good or service turns often at the favor of the agent who holds-up the 
principal. 

Asymmetric information 

According to a popular proverb, "virtue is in the eye of the beholder". This irrational 
feature of human nature (cf. §1.4.2), also called "self-serving bias", may lead negotiat- 
ing parties to exaggerate the importance of their contribution to the joint benefit up to 
the point where the sum of "apparent" opportunity costs overshoots it, thereby making 
impossible an agreement. This problem may be re-interpreted as one of asymmetric 
information because parties are unsure of their opportunity cost or of the value of coop- 
eration. 

More generally, when information relative to the bargaining is asymmetrically dis- 
tributed, the negotiation may fail so that the exchange is not performed when it would 
be efficient to do so. This frequent situation generates ex-post inefficiency i.e., waiting 
to resolve issues as they unfold does not always result in a Pareto efficient outcome as 
claimed by the Coase theorem studied in the next paragraph. Nevertheless, in most of 
the book, we overlook this problem and assume on the contrary that parties to a bargain- 
ing are able to trike an efficient deal. 

We present below a simple case of bargaining with one sided private information and 
show that the result is not always efficient. Dealing with asymmetric information on 
both sides is more involved and thus deferred to §22.3.3 in the auctions chapter. 

Consider an agent A who operates an asset of unitary value on behalf of a principal 
B (boss). 45@ Due to the complexity of the environment, it is not possible to specify ex- 
ante the precise service that A must render to B by operating the asset, nor the price to 
be paid for that. Ex-post, once all relevant information is known, B specifies the exact 
service she expects from A and makes a payment offer. The agent can either accept the 
offer or reject it and sell instead his services to an alternative employer. However, the 



agent has previously invested into the asset to improve its value within the relationship 
with B, thus the payment A gets in his best alternative is only 8 < 1. Equivalently one 
can adopt a rent-seeking view of the negotiation and assume that when A contests B's 
offer, he is able to divert the asset (e.g., steal) and recover a share of its unitary value 
for himself; in the process 1-6 of value is destroyed. The information asymmetry lies in 
the fact that A has private knowledge of 6 whereas from fi's point of view, this value is 
uniformly distributed over [0;1]. 

Since A can guarantee himself 6, he will accept fi's payment proposal p only if p > 0. 
The probability of this event being p, B's profit is p{\ - p) which is maximum for p -\. 
This choice means that A refuses the offer half of the time. In that case, he contests 
the offer to realize his larger default payoff 0. The agent's expected default payoff is thus 
E[max{0,i}] = \\ + \\ = §. The expected surplus is ±E[l|0<±] +\E[e\6>\\ = | + | = |<1 
i.e., there is an inefficiency due to asymmetric information. 

Under symmetric information regarding 0, the boss pays exactly 6 so that negotiation 
never fails and there is efficiency. The agent's payoff is then E[0] = |, thus the information 
advantage translates into a pecuniary advantage at the cost of a lessening of efficiency 
because the principal's profit drops from \ to \. The result remains qualitatively correct 
when if the (informed) agent has some bargaining power and is able to make a "take-it- 
or-leave-it" offer to B with probability a (as long as a < 1). 

Coase Theorem 

The theoretical and political issues surrounding the Coase Theorem are developed in 
§8.1.3. We contend here with its relation to bargaining. The basic situation considered 
by Coase (1960) is when firm JV, while producing generates an externality, positive or 
negative, affecting firm S's profits without generating a countervailing transaction. In 
the absence of communication, firm JV will maximize profits and ignore the externality. 
This outcome defines opportunity costs n$ and tin- If now a technological or institutional 
improvement enables parties to communicate and negotiate at no cost, they will be able 
to internalize the externality and restore efficiency; their joint profit will rise. 46@ In this 
ideal world, no State intervention is warranted. 47 ® 

Making firm JV liable for the externality caused to firm S only changes the opportunity 
costs, not the final efficient outcome. Observe indeed that under this new rule, firm S 
can guarantee itself a greater default profit n s > ns by requiring that firm JV sets the 
externality at the level optimal for him. However, this particular solution ignores that 
this restriction cuts back firm JV's profits down to n' N < tin- As illustrated on Figure 
2.9, the sum of profits may increase but it still falls short of the maximum because the 
externality level has not been chosen to equate its marginal value to marginal cost. To 



conclude, the externality can be internalized only if parties are able to communicate 
either directly or through some institutional arrangement. 

2.4.4 Free Riding & Social Dilemma 

A social dilemma is a situation where following individual rationality (IR) leads to a 
worse outcome than following social rationality (SR). Such a situation may appear in a 
public or a private setting. The former case is the most celebrated and also the more 
intuitive one; it is presented first. 

Public setting 

The archetypal case is a public service or public good (cf §16.1.1) such as Hume (1740 )'s 
draining of a meadow or Rousseau (1755)'s stag hunt (cf. §2.4.1). It is well agreed that 
each who stands to enjoy the public work should contribute to it but it is also well un- 
derstood that no-one will miss the opportunity to get away with the shore; this is the 
essence of free riding. 

At the family or village level (small group), peer pressure forces everyone to behave 
i.e., to follow SR instead of IR, because each and everyone becomes a controller of the 
proper contributive behavior of others. 48@ At the city or country level (large groups), 
collective action is harder to implement as it is comparatively easier to shirk and im- 
plicit control become diluted if not ineffective altogether. Authorities have, from time 
immemorial, solved this conundrum by forcing everyone to contribute time or money to 
the construction and maintenance of public works; later on, in kind coerced contribution 
is replace by taxation which allows to fund the modern State' activities. Noted author 
de Condorcet (1788) explicitly justifies coercive taxation to avoid free riding. 

Private setting 

Social dilemma are also pervasive in private markets but are referred to as prisoner 
dilemma. They are a great concern for Industrial Organization since they relate to the 
efficiency of market equilibria and the resulting justification for State regulation. As we 
show formally in §9.1, the best for an industry selling a standard private good or service 
is to coordinate marketing strategies so as to implement the monopoly price and earn the 
associated maximum profit (which is then apportioned among members appropriately). 
Yet, every single member firm deviates from the cartel agreement if she follows solely 
her short term interest. Depending on how we model the strategic interaction between 
firms, the ensuing equilibrium lies between the Cournot and the perfectly competitive 



outcomes. What really matters though is that all firms end up earning less (when com- 
pared to the successful cartel). Unless the industry, as a collectivity of economic agents, 
finds a way to discipline its members, it will suffer a prisoner dilemma and this consti- 
tutes the essence of the social dilemma in 10. 

Misconception 

Tuck (2008) shows that until the advent of game theory c. 1930, economists believe that 
rationality is socially based i.e., the bonding of the individual with his peer group is so 
complete that he takes the group's objective as his own. People or firms acting differently, 
typically in a selfish manner, are deemed irrational; they are either fools, ignorants (of 
the workings of the market), uninformed (about other firms), unable to coordinate (be- 
cause of the large number of participants) or overemphasize the short term. 49@ This 
misconception means that these authors see social dilemma as episodes of bounded ra- 
tionality. Now, if these bad outcomes result from the imperfections of man, there is 
the possibility to improve ourselves with education; the obnoxious IR behavior will then 
recede forever. It is our opinion that this lasting misunderstanding of the true forces 
underlying social dilemma have lead economists to neglect this aspect of economic inter- 
action. Thanks to Olson (1965)'s book on the logic of collective action, social dilemma are 
now at the center stage in economics. 

We conclude this section with a recollection of the opinions held by the important 
10 thinkers with respect to competition and how the modern conception used in this 
book emerged. Smith (1776) believes that oligopolists will take every opportunity to 
collude. He sees price wars, an instance of IR, as irrational behavior. His hopes to 
uphold competition therefore rest with freedom of entry 50@ Mill (1848) exposes neatly 
the free rider problem faced by a labor union but refuses to see the individual worker 
behavior as rational; rather he calls for the law to enforce the union agreement on the 
ground that SR is the real objective of the workers. 

Cournot (1838)'s model of duopolisitic competition based on IR behavior contains the 
first formalization of the prisoner dilemma (cf §2.4.1) but since the author ignores 
altogether the potential for cartelization, he does not address the underlying social 
dilemma. 51@ This change of paradigm from SR to IR is not well accepted. Bertrand 
(1883), for instance, is of the opinion that in such a market, there is either collusion 
leading to monopoly or a price war leading to perfect competition. The latter outcome is 
however deemed "unwise" given that the short-term benefit of free riding on the cartel 
(with a price cut) is always lesser than the long-term benefits of collusion. 

Edgeworth (1881) conclude his bargaining theory with the belief that collusion, not 
competition is rational for firms. Pareto (1896-97) makes a progress by assuming that 



economic agents either follow SR or IR, the discriminating criteria being the size of their 
group (industry, community). Marshall (1890) stresses the benefits of combination, par- 
ticularly through their ability to capture scale economies. Knight (1921) also believes 
that cartelization is the inescapable outcome when the conditions of perfect competition 
hold. Italian and Scandinavian public economists at the turn of the XX th century also 
view free riding as a pathological behavior and, in any case, believe that a unanimous 
consent can be forged for financing public endeavors (cf Musgrave (1939)'s account). 
Like de Borda (1781)'s voting method made to be used by gentlemen, these authors still 
believe in a social form of rationality. In a nutshell, they hold as inconsistent a person 
who values an outcome but do not to wish to contribute towards it. 

Eventually, Chamberlin (1929) (better know as Chamberlin (1933)) introduces the 
concept of oligopoly for competition between a limited number of independent firms that 
fail to internalize the negative externalities their market behavior impose onto others. 
He thus argues that monopoly can be sustained, not because firms follow SR but because 
when following IR, they have the means to retaliate to a price cut or any behavior threat- 
ening monopolization. He also concludes that perfect competition will arise once there 
are so many firms that each can safely sell as much as it wishes because the impact unto 
others will be too small to trigger retaliation. 



PartB 



Market Power 



Chapter 3 
Monopoly 



In his book Politics, the Greek philosopher Aristotle juxtaposes "sole" and "seller" to cre- 
ate monopoly, a concept he proceeds to illustrate with a tale: the clever mathematician 
Thales, to demonstrate his business wit, studied astronomy to foresee a soon to come 
abundant crop of olives, he then contracted out the services of all the olives presses in 
the city and made a huge profit by subletting the presses at a high price to heedless 
farmers when harvest came. Aristotle concludes that the acquisition of monopoly is the 
natural aim of any rational businessman, a claim we shall confirm. 1 ® 

In the first section, we define precisely monopoly and market power in economic lan- 
guage; then, we characterize the optimal behavior of a pure monopolist before assessing 
the welfare consequences of the exercise of market power. Lastly, we contemplate a 
number of extension of this basic theory. 

3.1 Optimal Behavior 

3.1.1 Typology 

A monopoly can be either de facto or de jure, that is to say either conquered from con- 
tenders or bestowed by public authorities. Among the latter we distinguish among pri- 
vate, regulated and public monopolies. Let us explain the distinctions through a series 
of examples. 

• de facto: The Boeing 747 held a monopoly over the long-haul high capacity aircraft 

market for 35 years before the appearance of the Airbus 380 contender because no 
other aircraft maker was able to produce a comparable plane. 2 ® 

• de jure private: The patented drug Nurofen held an almost worldwide monopoly over 

the painkiller market for 20 years because patents are recognized and enforced by 
almost all countries; patents are studied in §12 on R&D. 



• de jure regulated: In most cities, taxicab services is not a perfectly competitive market 

because the city council delivers a limited number of licenses and sets the tariffs. 
Whether there are many individual drivers or one firm operating a fleet, we face a 
regulated monopoly as far as economic efficiency is concerned. 

• de jure public: When the water network of a city is not operated by a private regulated 

firm but by a municipal agency, we face a public monopoly. Postal services is an- 
other example of public monopoly at the national scale. 

The first two categories can be brought together under the heading of private mono- 
poly which permits to drop the "de jure" prefix from the last two categories and derive 
a new classification based on the objective and constraints faced by the firm which fits 
better our economic study: 

• The private monopoly is free to seek profit. 

• The regulated monopoly is constrained in his pursuit of profit. 

• The public monopoly cares for welfare. 

3.1.2 Dominant Position 

There are few cases of pure de facto monopolies but in many markets there is a domi- 
nant firm; an overt example is Microsoft over the market for operating systems (OS) of 
personal computers (PC). The European Commission clusters these two cases into the 
concept of dominant position whereby a firm is able to behave independently of its com- 
petitors, customers, suppliers and of final consumers. Notice further that a monopoly 
exists only insofar there is no close substitute to its product; for instance, the Nurofen 
painkiller has imperfect subtitutes like Aspirin or Paracetamol. As we shall see, focus- 
ing on the pure monopoly case involves no loss of generality to characterize the optimal 
behavior of firms holding market power (or dominant positions). 3@ Likewise, the monop- 
sony (market with a sole buyer) being an entirely symmetric situation, we do not ascribe 
it much space (cf §3.2.3). 

It will be shown that with respect to the benchmark case of perfect competition,the 
monopoly earns an additional profit called the rent. 4@ This prize may motivate a de facto 
monopoly to abuse its dominant position to maintain it i.e., block entry or eradicate com- 
petition by anti-competitive means. This issue is taken on in Part D on antitrust laws. 
In this respect, recall that in the vernacular language, "monopolize" is synonymous of 
unfair exploitation or conspiracy to raise prices unduly and restrict output; it used to 



be judged as sinful as usury. Modern laws are thus an economic update of the Chris- 
tian doctrine according to which only two price setting methods are acceptable, perfect 
competition and the fair price set by public authorities. 

For a regulated monopoly, the rent is set by the regulator so that the firm may be 
tempted to sway (e.g., lobby) her to increase the size of the rent, an issue we study in 
§16.3 on rent-seeking. 

3.1.3 Market Power 

In the remnant of this chapter, we deal with a private monopoly; regulated and public 
monopolies are the object of §17. 

Awareness 

Monopolies and competitive firms are similar in many respects. Their profit functions 
both read pq-C{q) where p is the market price and q are the individual sales. Further- 
more, both understand that the market demand D is a decreasing function of the market 
price p. At a given price p, no more than Q = D(p) units can be sold. Inversely, the de- 
mand will absorb Q units only if the price is less than p = P{Q), the (market) willingness 
to pay. 

The only but fundamental difference lies in the relation between the price and the 
firm's sales. By definition, the supply q of a perfectly competitive firm is small in front 
of the total market sales Q and has therefore almost no effect on price, thus this firm 
rationally chooses to act as a price taker. For her, the price p = P{Q) is unrelated to her 
own sales q. 

The monopoly, on the other hand, is aware of his market power, the simple fact that 
his sales constitute the entirety of market sales i.e., Q = q. The equilibrium price is thus 
p = P{q) and is a direct consequence to his sales decision q; this permit to rewrite the 
monopolist's profit as 

U M (q) = P(q)q-C(q) (3.1) 

Since price and quantity are linked by the demand equation, we could as well see the 
monopoly profit as a function of the price he wants to charge, which is more or less the 
way real firms behave. It is however easier to work out the theory using quantity as the 
primary decision variable. Unless stated otherwise, the fixed cost of production is zero 
throughout this chapter, so that profit equals producer surplus. 



Intuition 

Before searching for the optimal quantity and price, let us try, using the left panel of 
Figure 3.1, to understand the motivation of a monopoly. Consider a raise from the com- 
petitive price p*; the rectangle grey area is the additional profit of selling the good at a 
higher price, being collected on all infra-marginal units it is called a volume effect. The 
small triangle, on the order hand, called the price effect, is the loss of profit generated 
by the loss of consumers; it is a negligible effect at p* because the area of the triangle is 
very small compared to that of the rectangle. 5@ Hence, 

| A monopoly finds it profitable to raise his price above the competitive level. 

The limit to the price increase by the monopoly is attained when the volume and 
price effects are equalized as shown by the grey areas on the right panel of Figure 3.1: 
the difference with the first case is that now, the consumers lost due to the price increase 
(those around q M ) were very profitable ones because the unit margin p-C m was large. 
The price effect now counterbalances the volume effect. We now proceed to characterize 
the optimal sales. 




Figure 3.1: Competitive and Monopolistic Markets 



3.1.4 Output Distortion 

Optimal Choice 

Formally, the optimal quantity q M maximizes the profit U M and since this function is 
concave, its maximum is reached when the derivative becomes nil. The derivative of the 



profit is called the marginal profit and since the profit is made of revenues minus cost, 
the marginal profit is the difference between marginal revenue and marginal cost. The 
maximum profit is thus reached when marginal profit is nil, that is to say when marginal 
cost and marginal revenue are equated. In formulas, this first order condition for profit 
maximization reads 

—— = 0o R m = C m (3.2) 

dq 

As we saw with equation (2.20), the marginal revenue is the price minus the marginal 
loss of consumer surplus: R m = P+W' D < P. We see on the right panel of Figure 3.1 that the 
R m curve slopes down faster than the demand curve, hence the monopoly quantity q M is 
lesser than the competitive one q* (the latter solves P = C m ). Given the negative relation 
between price and quantity, the monopoly price p M is larger than the competitive price 
p* , as we already saw using economic intuition. To conclude, 

A monopoly maximizes his profits by equating marginal revenue and marginal cost; 
as a result, he sells less than the efficient quantity at a price greater than marginal 
cost (the efficient price). 

Let us relate the optimality equation (3.2) to the previous graphical analysis. The 
gain from raising the price, the volume effect, is qAp while the loss of consumers is 6@ 
Aq = D'Ap = -£ which leads to a profit loss of Aq{P - C m ); this is the price effect. Starting 
from p* where P = C m , the gain clearly outweighs the loss (which is zero); the optimum 
is attained when both effects neutralize themselves i.e., when 

qAp = Aq(P - C m ) o qAp = Ap^^ oC m = P + qP' = R m (3.3) 

One should take care to avoid mixing the marginal changes shown by the grey areas 
with areas T + A and B that sum those marginal changes when the price rises from p* 
to p M . The change in profits for the monopoly in passing from p* to p M is the difference 
of areas T and B; we might wonder whether this quantity is positive. The answer is a 
clear yes provided by a revealed preferences argument: offering p M ^ p* reveals that the 
monopoly has found a way to increase profits for he could always pretend to be myopic 
and act as a price taker if it was better for him. The excess of profits T-B above those 
characterizing perfect competition is called an extraordinary profit or a quasi-rent; as 
we shall see in §17.3.1 on regulation, the presence of quasi-rents tells us that the capital 
invested into the monopoly has a return higher than the market average. 



I A monopoly obtains economic profits above the competitive level. 

Lerner Index 

Using the price elasticity of demand e = q p,f p] (cf. eq. 2.14), Lerner (1934) rewrites 
equation (3.2) characterizing the optimal monopoly price as 

R m = C m o ^ = ^^HL = i/ £ (3.4) 

V 

i.e., at the optimum, the Lerner index of market power if equates the inverse of the 
demand elasticity 7 ® 

This formula extends beyond the monopoly case to just any firm. Whatever the mar- 
ket structure (anything between monopoly and perfect competition), the demand ad- 
dressed to a firm i can be related to the price pt she charges, thereby leading the so- 
called residual demand function D t (which possesses an inverse Pi). Observe then, by 
going back to the analysis of optimal monopoly behavior, that the optimal pricing policy 
of firm i is to equate her Lerner index (cf. eq. 3.4) to the elasticity of her residual de- 
mand (i.e., change D by D t and P by Pi everywhere). In the monopoly case, the relevant 
elasticity is that of the aggregate demand while in a competitive market, it is infinite 
(recall indeed that a competitive firm can sell as much as she wants at the market price). 
We can thus conclude: 

I As the market structure evolves from monopoly to perfect competition, the ability of a 
firm to price above marginal cost is reduced from a maximum down to zero. 

The previous characterization is also useful to compare two markets like energy and 
entertainment. Whenever we own a gas owen, our ability to bake without gas is ex- 
tremely low, therefore our demand for gas is quite insensitive to its price. On the con- 
trary, if the toll for Pay-TV rises we can switch to one of the many substitute forms of 
entertainment at our disposal, thus our demand for a monopolized entertainment is sen- 
sitive to its price. Applying equation (3.4) tells us that energy firms with market power 
are able to price way above their marginal cost while entertainment firms are bound to 
be more competitive. This is why basic services like energy or water are regulated to 
avoid excessively high prices while hertzian TV is not. 

Numerical Illustrations 

Let us apply our findings to the numerical example used in the perfect competition sec- 



tion; the inverse demand function is P{q) - —r 1 while the cost function is C{q) - -%-. Leav 



ing detailed calculations as an exercise for the reader, equation (3.2) reads ^-^- - cq, so 
that we find q M = ^^ < q* = ^f^ for all n > 1 (the number of competitive firms). The 
optimal monopoly price is p M = ^2+fecl an ^ ^he Lerner index achieves the value 5£ = j^. 
For the case of a constant marginal cost c that we predominantly use in this book, the 
monopolist's FOC becomes ^ = c, so that we find q M = ^ and p M = ^. The efficient 
behavior 8@ is in that case to price at marginal cost i.e., p* = c from which we deduce the 
efficient quantity q* = a- be. At the monopoly equilibrium, the Lerner index (cf eq. 3.4) 
is ££ = |=|jj while profits are U M = -^{a- be) 2 and consumer surplus is Wjf = -^ (q M ) = 
^ [a - be) 2 . Finally, the (market) welfare is 

M 3 {a- be) 2 „ {a -be) 2 ,„ _. 

w m = ^ '_ <w * = i '- (3.5) 

8b lb 

the welfare at the Pareto optimum, which is the maximum achievable in this market 
(for given technology and demand). A relationship useful for empirical estimations is to 
observe that the deadweight loss 9@ is W* - W M = |n M . Checking the exactness of these 
formulas is a fruitful exercise for the reader. To conclude, 

| A monopoly does not behave efficiently and generates a loss of welfare. 

The output distortion and its consequences in terms of welfare will be commented at 
length in the next section. 



3.2 Inefficiency 

As we shall see in this section, the unrestricted behavior of a monopoly is inefficient in 
the sense that it would be possible in an ideal world to achieve higher surplus with the 
same technology while meeting the same needs, von Pufendorf (1672) tells the archety- 
pal case of the Dutch East India Company who destroyed spice crops in various parts of 
India in order to prevent an over-supply 10@ The purely economical result we work out in 
this section rationalizes the predating moral view according to which monopoly is "evil" 
and must be reigned in (cf. §8.2.4). The public attitude towards monopoly is the object of 
chapter 17 on regulation. 

3.2.1 Welfare Loss 

As can be checked on the right panel of Figure 3.1, the switch from a perfectly competi- 
tive market (or any organization yielding efficiency) to a monopoly is accompanied by a 
wealth transfer T from consumers to monopoly and by a welfare loss. Indeed, we already 



saw with equation (2.22) that any price different from the competitive one p* generates 
less trade than the maximum achievable q* and consequently a welfare loss. With a 
monopoly welfare is W M = U M {q M ) + W D {q M ) < W* , the difference called the deadweight 
loss triangle being measured by area A+B. The loss of consumer surplus over the q* -q M 
units not consumed is A while B is the opportunity cost (profit loss) of not producing and 
not selling those units. 

The increase of producer surplus U M {q M ) -U*[q*) is called the monopoly rent and is 
measured graphically by area T-B. Whether the monopoly is a private firm or a public 
enterprise, it is ultimately own by consumers, thus the only social cost is the deadweight 
loss A + B caused by insufficient output. For a long time, economists have sticked to 
this point of view and have estimated empirically the welfare loss. The first study by 
Harberger (1954) using manufacturing data from the 1920s finds a welfare loss of j^% of 
GDP, more recent studies compute a greater figure (several percent of GDP). 

§17 studies the government intervention aimed at reducing the distortion brought 
about by unrestrained monopoly behavior. 

3.2.2 X-efficiency 

According to Hicks (1935), "the best of all monopoly profits is a quiet life", reflecting 
the idea that an avid young entrepreneur is bound to become a bourgeois once he has 
achieved industrial success. 11 ® Leibenstein (1966) takes on this simple observation to in- 
troduce the loose concept of X-efficiency 12@ according to which firms with market power 
have higher costs than competitive firms. It is argued that in the absence of competitive 
pressure, shareholders get a constant stream of dividends and employees are protected 
from losing their jobs. Nobody is therefore poised to improve the competitiveness of the 
firm (i.e., reduce production costs). Employees, managers and workers, have leeway or 
slack on how to do their jobs and fail to implement "cost-cutting" processes, innovative 
technologies or learn how to better serve demand; rather they all engage in rent-seeking 
conduct. Although this view was never backed by formal theory, the European Commis- 
sion argued in 1988 that "...the new competitive pressures brought about by the comple- 
tion of the internal market can be expected to ...produce appreciable gains in internal 
efficiency... [which will] constitute much of what can be called the dynamic effects of the 
internal market..." 

Several criticisms were addressed to this theory and accepted by its author. Firstly it 
is a sketchy story not supported by a game theoretical analysis of interactions inside the 
firm. In other words, it fails to explain why there is a change of behavior (with respect 
to cost minimization) when the environment changes from monopoly to competition. As 



we all know, the lust for profits is universal, thus equally strong for the shareholders 
of monopolies and competitive firms: why would an investor holding shares in two such 
companies take a different behavior in the boards of the two companies ? Why would 
he support "cost-cutting" in the competitive sector and "take-it-easy" in the monopolistic 
one ? 

Secondly, the X-inefficiency is said to arise because of a generalized slack in effort in 
the firm, the absence of adequate incentives towards cost minimization. As forcefully 
argued by Stigler (1976), the failure by managers to motivate employees or the failure 
by owners to motivate managers is not a waste; it is the rational outcome (a Nash equi- 
librium) of the utility maximization performed by every stakeholder in the firm. One 
must realize that the firm's technology is not limited to machines and land but also en- 
compasses the internal organization and the entrepreneur's ability; each firm therefore 
follows the neoclassical theory to produce an output belonging to its technological fron- 
tier. The only conclusion we can draw from Leibenstein (1966)'s examples is that some 
frontiers are higher than others (cf §2.1.2 on productive efficiency) and above all, that 
managerial innovations like piece rate remuneration and yardstick competition (e.g., 
"employee of the month" prize) are very effective. 

Raith (2003) explores the issue of product market competition and managerial in- 
centives to rationalize the diffuse idea that "more competition induces a better man- 
agement". He shows that if "more competition" means a more elastic demand or less 
differentiation, then market power is reduced; this, in turn, forces some firms to exit 
(recall that in the long run, there are no economic rents) so that the remaining firms sell 
more. As we show in §12.2.4, the optimal level of management effort (deemed R&D in 
that chapter) is proportional to sales; in the present case, more competition has raised 
the incentives toward effort i.e., better management is observed. The same occurs if the 
market size grows; although there is entry, each firm sells more and therefore provides 
more incentives to its managers. Surprisingly, a deregulation which generates entry 
in a formerly monopolistic market leads to a reduction of incentives (according to this 
theory), contrary to common wisdom and to what the X-efficiency thesis predicts. 

In the end, X-efficiency is best seen as a forerunner of the agency theory studied in 
Part H; it raised the attention of researchers on the inner logic of firms, on incentives, on 
the features of contracts used inside firms and how changes in organizational structure 
occur (cf. §13.1). Another argument (absent from the X-efficiency story) that might 
explain the higher costs observed in monopolistic industries is the presence of strong 
asymmetries of information across all market players. It is easier to perform relative 
evaluation of managers or divisions when the firm participates in a competitive market 
because there are many similar units to compare with (yardstick); estimates are then 



more accurate and burden managers with less risk. Hence, shareholders of monopolistic 
or oligopolistic industries face a more difficulty task to extract the valuable information 
in possession of their managers. §21.2.3 exposes this issue at length for the regulation 
of a public monopoly. 

X-efficiency is concerned with the internal organization of firms and the failure of 
those working inside to minimize productions cost. The source of X-inefficiency are 
incomplete contracting, effort discretion and misalignment of objectives. 

3.2.3 Extensions 

Multiple Products 

Although Volkswagen (VW) owns Seat (brand #1) and Audi (brand #2), these brands are 
competitors on the consumer market. As a firm with market power, Volkswagen must 
account for the possible substitutability or complementarity of these two brands to set 
its prices optimally. On top of this pricing issue, the company must also account for the 
fact that each brand is produced in a different plant (cf the multi-plant firm problem 
seen in §2.1.3). 

To understand how multiple products affect the pricing decision of firms, consider a 
monopoly offering products #1 and #2; since both demands depend on both prices, total 
profit is 

n(pi,p 2 ) = PiD\pi,p 2 ) + p 2 D 2 (pi,p 2 ) -C\D\ Pl ,p2)) - C 2 {D 2 {p l ,p 2 )) (3.6) 

so that the first order condition for price p\ is D 1 + p\D\ + p 2 D 2 = C] n D\ + C^D 2 (we denote 
D l . = |^-). After some algebraic manipulations we obtain 

a> 1 = J__^£H^ (3.7) 

£-11 CllPlD 1 



where ££\ is the Lerner index (cf. eq. 3.4) for price p\, e u = 15W p is the own-price 
elasticity of demand and e i2 = ^§rl-p- is the cross-price elasticity of demand. 

The interpretation is quite simple: if the two products are substitutes like automo- 
biles, then eu < so that S£ \ > j- which means that the optimal price is larger than 
without taking into account the sales externality. If, on the other hand, products are 
complements like computers and operating systems, then the optimal prices are smaller 
than if they were set by two independent firms. To conclude, 



I A monopoly lowers the prices of complement products (cross-subsidization) to boost 
sales while he raises the prices of substitute products to boost margins. 

Monopsony 

A monopsony occurs when there is a sole buyer for a product. The state is a legal monop- 
sony for the recruitment of civil servants while examples involving private firms and 
the labour market are for instance VolksWagen in the city of Wolsfburg, Germany or 
Michelin in the city of Clermont-Ferrand, France. 

As we previously saw in §2.1.5, firms have a decreasing marginal willingness to pay 
V m for additional units of factors, thus this is also true for a monopoly and for the labour 
factor. Conversely, the supply of labour L is an increasing function of wage w due to 
the leisure-work optimal arbitrage made by of consumers-workers. The inverse sup- 
ply function is denoted w{L) and the total wage expenditure is B{L) = Lw{L). Efficiency 
commands to increase employment until the productivity of labour V m equates the op- 
portunity cost w. The monopsony, on the other hand, maximizes V{L) - B{L) i.e., solves 
V m = B m = w + Lw > w which involves a quantity L M lesser than the efficient L* and a 
wage lesser than the competitive one w*. Such an outcome occurs because, to employ 
more people, the monopsony is forced to raise everybody else's wages, so that the margi- 
nal cost of labour is not w but the greater B m . To conclude, 

I A monopsony employs an inefficiently low amount of productive factor by offering a 
price lower than the efficient one; this generates a loss of welfare in the factor market. 

3.3 Quality 

In this section, we study the relationship of quality towards the exercise of market power. 
Quality is an attribute of a product or service that all consumers view as useful or 
desirable. The resistance of an automobile to a crash-test or the number of airbags are 
objective quality elements since their adequacy can be assessed by technical tests. The 
presence of air-conditioning or leather seats are also seen as quality additions, although 
the utility they bring is more difficult to quantify. In the end, the usefulness of any at- 
tribute is subjectively judged by consumers alone which means that quality is a demand 
side attribute of a product or service. Contrary to engineers' belief, supply side attributes 
such as technical prowess or costly parts do not bestow any quality upon an item. 13@ 



3.3.1 Monopoly and Quality 

Quality vs. Cost 

We assume for the purpose of our study the existence of an index s taking into account 
all quality characteristics of the product; this way quality ranges from a minimum to a 
maximum. Since cheap and obvious quality additions to a product are common knowl- 
edge, increasing the quality index requires investments into market analysis to discover 
what new features (or altogether new products) consumers would like to possess. Then, 
one has to invest into R&D to develop, test and add such features to the current line of 
products. Finally, one has to advertise the novelties to establish leadership upon com- 
petitors (and reap the rewards through a higher selling price). It is generally accepted 
that all three steps of product development display decreasing returns to scale i.e., if k 
denotes the investment into any of these activities, the final quality index is s = f{k) with 
/" < < /' and where /(0) is the quality of the standard (minimum quality) product. 

A client assesses a product in terms of the value it delivers. We may assume that 
the latter is v = qs where q is the quantity of product consumed. Since the investment k 
impacts mostly the quality perception of the product, we may, in a first approximation, 
assume it has no impact on production cost i.e., the cost of producing q units is c{q) 
irrespective of the investment k. The cost of delivering final value v is thus cf-^j). We 
may then interpret the investment into product development (market research, R&D, 
advertising) as an effort to reduce cost. Hence, we can study w.l.o.g. R&D in §12 as 
a cost reduction activity rather than a demand booster activity. We apply this trick 
because viewing R&D as a supply side activity makes it much more amenable to the 
study of oligopolistic competition. 

As we explain in §11, two products of different quality are said to be vertically dif- 
ferentiated whereas two products with distinct characteristics that appeal to different 
segments of the market are said to be horizontally differentiated. The monopoly attitude 
wrt. variety is treated in §6.1.4 and leads to a conclusion endorsed by casual observation: 

Firms with market power offer too much variety (brands, shops) given that the fixed 
cost of establishing a variety although heterogenous consumers long for many differ- 
ent ideal varieties. 

Optimal Quality 

Many people intuitively believe that the quality of a commodity or of a service like water 
distribution can never be too high. They err because in the process of providing quality, 
we are sacrificing scarce resources which would produce more service to the economy in 



some alternative use (either in another public service or in production of private goods). 
This occurs as a consequence of the decreasing returns to scale of the technology used 
to improve quality. What is then the behavior of a monopoly with respect to quality 
selection? Is it too low or too high in comparison with an efficient level? To answer this 
question, we assume that the quality index s impacts demand D{s,p) in the following 
manner: ^7 > > |-y: an increase in quality raises the demand but with a fading force. 
For a monopoly market structure, and ignoring fixed costs, welfare is 

W = W D + W S = I D(s,x) dx + n 
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Contingent on a price level (whether p M , p* or anything else), we have 
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Hence, once the monopolist sets s so as to satisfy §f = 0, the welfare can still be increased 
by raising quality further. 14 ® 

Contingent on the market price, a monopoly under provides quality which rational- 
izes the frequently observed minimum quality requirements imposed by regulators on 
firms whose price or production is regulated (cf. §16.3). 

The above conclusion is partial because the monopoly and the planner (who maxi- 
mizes welfare) choose different prices, thereby making impossible an absolute judgment 
with respect to quality. To go further, we have to study in more detail the willingness to 
pay P{s,q) which now depends on quality. 

When j-^- s < i.e., the WTP for added quality ^ falls as consumption increases, then 
quality and quantity are said to be substitutes; an example is clothing because if you 
switch to a higher quality, you need to replace less often your wardrobe, thus your WTP 
for a fifth pair of shoes is lesser than it used to be. If the reverse inequality ££ > 
holds, then quality and quantity are said to be complements as could be the case for food 
because the WTP for a better restaurant is higher for people who go out dinning often 
(than for those who go out rarely). 

When quality and quantity are substitutes, the general conclusion reached by Spence 
(1975) depends on the elasticity of demand e which measures the innate market power 
of the monopoly. If e is large, the monopoly and efficient quantity are not too different 
so that the quality ranking is not changed: the monopoly under provides quality. If, on 
the contrary, e is low (e.g., electricity) then the monopoly provides an excessively high 
quality. 15@ These findings are reversed when quality and quantity are complements. 



Table 3.1 summarizes our findings. 



market elasticity is 


high 


low 


quality and quantity are 


substitutes 


s M is insufficient 


s M can be excessive 


complements 


s M can be excessive 


s M is insufficient 



Table 3.1: Quality selection by a monopolist 

The previous findings shed light on some practices, for instance the policy of an opera 
house or cultural business. The private operator caters to the marginal client (an occa- 
sional opera buff) who is less knowledgeable than the average client (an aficionado); he 
thus chooses a familiar repertoire whereas a public operator who caters to the average 
client will choose a wider selection of works and authors. This is empirically validated 
when looking at the restricted repertoire of privately promoted classical music shows 
wrt. government sponsored ones. The same analysis goes on to all the recreational ac- 
tivities (media, culture). 



Linear Setting 

Mussa and Rosen (1978) offer a very simple treatment of quality choice by a monopolist 
that serves as a basis for the study of competition and differentiation. 

• Quality s ranges from zero to some maximum, normalized to unity. 16@ 

• Clients are considering whether to buy a single unit. 

• Heterogeneity: the WTP 6 for the top quality ranges from zero to some maximum, 
again normalized to unity (cf extension later on). 

• WTP is uniformly distributed; total market mass is unity. 

• A client with type 6 has WTP 6s for quality s e [0; 1]. 

• Zero production cost. 

2 

• Cost of designing quality s is y. 

When offered at price p, the item is sold to all types with 0s>p. Demand is thus made 
of all the people with type between ^ and 1 i.e., q=l-j. Since market WTP is P{s, q) = 
s{l-q), quantity and quality are substitutes (according to the previous classification). 
Consumer surplus is Wd = fp /s {6s-p)dd = - s j^~ p xdx = ^- s (s-p) 2 = \q 2 . Profit being n = pq, 
the optimal price is p M = f leading to sales of \, profit n M = | and welfare W M = y . 

2 

Taking into account the cost of designing quality, we maximize net profit U{s) = f -■§-; 
the optimal quality choice is then s M = \. Observe at this point that, since production 
cost are zero, efficiency commands to giveaway the item, so that demand is unity and 



welfare W* = ~. The efficient quality maximizing net welfare is s* = \, twice the optimal 
choice of the firm. We have thus shown that the monopolist under provides quality 
because uniform pricing forces her to forgo clients with a low WTP which then dampens 
her incentive to invest in quality. 

A simple extension is to allow for WTP to be distributed between zero and some upper 
bound 6 while keeping total market mass at unity. It is a matter of algebra to check that 
demand is then q = 1 - £- so that profit is maximum for p M = ^ which gives a maximum 

us ^ 

profit of n M = ^ and welfare W M = ^. Letting the cost for quality being |s 2 , the optimal 

-2 -2 

choice is s M = f* , leading to net profit ^ and net welfare -jL^. Exactly as before, the 
efficient quality maximizing net welfare is twice larger than the monopolist's optimum 
and net welfare is also twice larger. 

3.3.2 Transportation f 

Frequency of service is a key quality attribute for mass transportation (e.g., airplane, 
train or bus). We apply the previous theoretical considerations to the specific case of a 
transport monopolist, say a train operator. 

Letting t be the time interval between two departures, the maximum waiting time is 
|, so the average is only |. If a consumer values, on average, 8 each hour lost waiting 
for the train, then his expected opportunity cost of waiting is ^. Letting T stand for the 
duration of the service period (e.g., 24h) and s for the frequency of carriers (the quality 
index), we have t = y, so that the unit cost of time is - s where y = ^f is the sole parameter 
we shall need to derive the optimal monopoly conduct. 

We assume that the market WTP for immediate service is the standard linear formula 
^Y-; the WTP contingent to a quality of service s is then P{q,s) = "^ - ^ as we subtract 
the opportunity cost of waiting. 17@ The gross utility of consumers is U{q, s) = Jq P{x, s) dx = 

q^-§ ™. In this setting, two cost dimensions emerge, the traditional one linked to 

volume and a new one linked to quality (frequency). Indeed, the cost of transporting q 
customers in s carriers is C{q,s) = 9s + cq where 6 is the cost of moving one carrier and c 
the cost of moving one passenger. 18@ The monopolist's profit is now easily computed as 

a — a ra 

n = qP(q, s) - C(q, s) = q - — -cq-Os (3.8) 

b s 

while welfare is W = U -C. The optimal monopoly quantity, conditional on quality s, 
solves the FOC 

^ = 0oR m = C m ^q M = l[a-bc-^-) (3.9) 
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while the efficient quantity solves |^- = ^ o P{q,s) - C m yielding q* = 2q M as in the 



standard monopoly set-up since we are assuming the same quality. 
The optimal quality chosen by the monopolist solves the FOC 

9 ^ = 0o s 2 = lq (3.10) 

which, in our simple additive model, 19@ is also equivalent to the FOC for an efficient 
quality ^ = |j. It turns out that the incentives for quality are the same for the monopo- 
list and the social planner but since the later chooses a greater quantity, he also ends-up 
choosing a greater quality (check in (3.10)). In turn, this implies q* > 2q M by a simple 
comparison of the FOCs for optimal quantities. The optimal carrier size, f = ^ by (3.10), 
is therefore larger at the efficient combination as compared to the monopoly's choice. 

To conclude, let us inquire how changes in carrier cost 6, opportunity cost of time y 
or market size a- bc 20@ impact the quality choice of the monopoly. Using (3.9), we can 
rewrite (3.10) as an equality between marginal benefit and marginal cost: 

a-bc-^=2q M = — s 2 (3.11) 

s y 

Since s > 1, the LHS is an almost flat curve while the RHS is convex increasing. Ne- 



glecting ^ in the LHS, an approximate solution is s M - y r ~ 6 . Hence, a smaller train 
cost (9 \), a greater opportunity cost of time (7 /) or a larger natural demand (a-bc /) 
leads to greater frequency s M , greater traffic q M and, due to (3.10), a greater carrier 
size to achieve these choices. Since q* - 2q M , (3.10) implies that the efficient frequencies 
and carrier sizes are approximatively 44% larger than those optimal for the monopoly 
(multiplied by a factor \fl). 

3.3.3 Monopolistic Taxation 

Another application of the monopoly theory of quality choice is the provision of pub- 
lic goods and services by the State using taxation as developed by McGuire and Olson 
(1996). We are dealing here with public goods such as national defense or the security 
of property rights and public services such as infrastructures, education or health (cf 
§16.1.1 for precise definitions). These goods and services reduce the uncertainty of eco- 
nomic endeavors and many cost items, thus facilitate trade and investment so that in the 
end the economy grows faster. 21 ® The fundamental parameter is then which objective 
pursues the State. Analytically, a more or less representative elite taxes the economy; it 
runs from despotic appropriation to benevolent planning at the hand of a democratically 
elected government. 22 ® We also perform comparative statics over the size of the elite 



(without however explaining how it can grow). 

Leviathan 

Consider first the case of a single decision-maker, the ruler, 23@ holding power over a 
territory producing taxables riches. Concretely, one may think of a lord or a small clique 
whose source of power is the absolute control over security forces such as police and/or 
the army (cf §16.2.4). The ruler can use part of tax revenues to finance public goods in 
anticipation that the local economy will grow richer and thus become a larger tax base 
that can be further plundered. The tax base takes the role of market demand and is 
therefore denoted D{p,s). The tax rate is p (i.e., market price) while and the amount of 
public goods s is akin to a quality index (measured in monetary units). That D decreases 
with p reflects the disincentive that taxation creates on producers who anticipate the 
lower return of their investments. 24@ That D increases with s reflects the productivity 
enhancement brought about by public goods. 

The first-best choice maximizes welfare W = D{p,s) - s. 25@ It is immediate to observe 
that efficiency calls for the elimination of taxes (set p = 0) to avoid distorting private 
activity and to choose the public goods amount s* solving §7 =1. The ruler, on the 

oi p=0 

other hand, seeks to maximize her net wealth n = pD{p,s) - s under the financing con- 
straint of public goods n > 0. To clarify the exposition, we assume D{p,s) = r(p)®(s) where 
O(s) is the aggregate maximum production in a free-market economy with a level s of 
public goods and r{p) is the net labour supply that producers choose upon anticipating 
taxation of their returns at the rate p. We assume decreasing returns to scale (DRS) wrt. 
production with O" < < O'. Regarding the reaction of producers to taxation, we assume 

• r(0) = 1: without taxes, work incentives are perfect and the aggregate maximum 
production is achieved. 

• r' , r" < 0: higher taxes generate fewer production at an increasing rate. 

• r(l) = 0: if all the tax base is taken away, no taxable output is produced because 
workers are either gone or dead. 

In this very simple setting, the efficient level of public goods solves O' = 1 while the 
ruler maximizes n = pr{p)Q>{s) - s i.e., chooses the tax rate p° maximizing the net return 
R (p) = pr{p) < 1. Since RoiO) = = Rq{1), the graph of Rq is bell-shaped, thus reaches an 
interior maximum at the solution of the FOC p = -fr, as shown early on by Khaldun 
(1377) (it is also known as the Laffer curve). The optimal amount of public goods s° then 
solves O'(s) = R * 0) - 26@ Since the RHS is greater than unity, there is under-investment 
(wrt. first-best) because the ruler grabs only a share Ro{p°) of potential output. The 
crowd's per-capita utility is u = (1 - p°)r(p )O(s°). 



Clique or Elite 

The ruler is now replaced by an elite making up a share a of the population; its income 
thus comprises taxation revenue and net-of-tax producer surplus i.e., its objective is now 
n a = n + a(l - p)r{p)®{s) while the public finance constraint remains R (p)Q>{s) > s. Since 
the maximum of {l-p)r{p) is achieved for zero tax (p = 0), the maximization of n a involves 
an intermediate tax rate p a between and p° i.e., the elite has an encompassing interest 
for the economy and restraints itself from too much taxation. 

Observing that n a = R a {p)<${s) - s where R a {p) = [p + a{\ - p)) r{p) > Roip), the optimal 
tax solves R' a {p) = o p = -jr - -^ which shows, again, that the optimal tax decreases 
with the elite size a. The optimal amount of public goods s a then solves O'(s) = R j a] 
which leads to s a > s° i.e., now that the elite has an encompassing interest with the rest 
of the crowd, it favors public goods more than a lone ruler. 27@ The crowd's per-capita 
utility is now u a = (1 - p a )r{p a )Q>{s a ); it increases in all three dimensions, the Pareto 
possibility frontier rises (more public goods), their incentives are greater (due to less 
taxes) and more of the produce remains their own. 

For small a, the amount of public goods is low and is therefore easily financed i.e., 
in the neighborhood of a = 0, p a r{p a )Q>{s a ) > s a . However, at the other extreme, we have 
p 1 = 0, thus no resource at all to finance the large amount s 1 . This means that for large 
a, the budget restriction pr{p)<S>(s) > s is binding. Since the technology displays DRS, 
the average cost ^ is increasing, thus the equation pr{p) = ^ defines a positive re- 
lationship s = h{p). 28@ The elite's objective is now n a = [p + a{\ - p)) r{p)®{s) - pr{p)<&{s) = 
a{\- p)r{p)<5>{h{p)). Observe that the optimal choice is now independent of a i.e., the 
elite has grown so large that it is now "super encompassing" and treats the crowd as if 
it belonged to the elite. Since the term (1 - p)r{p) is decreasing while the term <S>[h[p)) 
is increasing, an interior solution p* exists together with the associated level of public 
goods s* = h{p*). 

As a matter of example, if we take r{p) = l-p 2 and O(s) = y/s, we find p a = ~ g+ 3 ( 4 1 g2 ~ ) 6g+3 
and s a = ^ while p* = | so that the super-encompassing minority starts at a = ± when 

p a _ *_29@ 

Monopolization and Rent-seeking 

Congleton and Lee (2008) enrich this model of public finance with an alternative mean 
of generating revenue for the ruler: the sale of monopoly licenses. It is easy to see that 
the monopolization process of the economy though the sale of an ever increasing number 
of licenses goes on up the point where the marginal revenue equates the marginal loss 
of general taxation generated by the lesser economic activity in the monopolized sectors 



of the economy. 

As we recall in §16.2.4, monopoly licensing was the main revenue channel for mer- 
cantilist governments in the pre-industrial area. Its relevance nowadays is weaker but 
still present. Some developing countries use it more or less explicitly (to turn around 
formal prohibitions). Even advanced countries recur to it occasionally, for instance when 
auctioning licenses for mobile phone telecommunications. More recently, cash-strapped 
Greece sold future income from airports and lottery to private investors in return for 
immediate cash. Similarly, the city of Chicago, in need of funds, privatized its parkings 
for a price widely believe to understate the net present value of the assets. Nowadays, 
the most widespread form of privilege sales are those paid in kind with political support 
(cf §16.3). 



Chapter 4 
Differential Pricing 



One speaks of "price discrimination" or "differential pricing" when a firm sells two simi- 
lar products with close marginal cost of production at different prices. The discriminat- 
ing characteristic can be the identity of the buyer, the calendar date, whether the cus- 
tomer is a new comer, the total volume bought, whether another item is bought jointly. 
A few examples will help to set the stage. The famous Coca-Cola brand offers a neat 
example using May 2010 Spanish prices; you can drink this beverage at the bar, from 
a can bought at an automatic dispenser or a variety of bottles and bundles found at a 
nearby store. 



Size (liter) 


.20 


.33 


12x.20 


12x.33 


.5 


1 


2 


2x2 


4x2 


Price (€) 


1.5 


1 


2.84 


5.93 


0.8 


0.97 


1.36 


2.36 


4.65 


€/liter 


6 


3 


2.36 


1.5 


1.6 


0.97 


0.68 


0.60 


0.58 



Table 4.1: Retail Prices for Coca-Cola 

Next, we've queried a major airline on the internet in November 2005 for two round- 
trips, Paris-— New- York and Paris^Madrid. 



Transatlantic 


price (€) 


Continental 


price (€) 


economy 1 month 


393 


week-end now 


190 


economy week end 


930 


week-end later 


103 


economy in week 


2230 


long stay 


260 


~ flexible 


3160 


one way now 


715 


~ business class 


5084 


in week later 


408 


~ first class 


8064 


during this week 


1000 



Table 4.2: Airline Tickets Prices 



The "new economy" offers another striking example of differential pricing. Odlyzko 
(2004a) estimates in Table 4.3 the revenue generated by a megabyte of digital connec- 
tion according to the use that is made of it; there is a factor of 25 millions between the 
cheapest and most expensive channels. 



Channel 


$/MB 


Cable TV 


0.00012 


ADSL internet 


0.025 


Voice Phone 


0.08 


Dial-up Internet 


0.33 


Cell phone 


3.50 


SMS 


3000 



Table 4.3: Revenue from digital services 

In this chapter, we go beyond the basic monopoly pricing exposed by Cournot (1838) to 
explore the refined use of market power as first analyzed by Dupuit (1844). Differential 
pricing takes advantage of the heterogeneity of customers' willingness to pay to design 
personalized prices. The idea is to be nearer to one's customers needs in order to charge 
them higher prices which is echoed in the old maxim that "one ought to tariff at what the 
demand will bear". Differential pricing was first systematically analyzed by Pigou (1920) 
under the heading of price discrimination (cf definition in §4.1.1). The complementary 
strategy of differentiation, studied thoroughly in Part E, is to be different from one's 
competitors through a diversification in quality and/or variety. 

The plan of the chapter is as follows. The first section extensively discusses the con- 
cept of discrimination, the conditions for its application and its basic instruments. The 
next section shows how a firm with market power uses the instruments to extract con- 
sumer surplus given its legal and informative environment. The last section tackles 
differential pricing when the firm is in the weakest position and shows that an intelli- 
gent design can lead clients to self select a category or segment upon which traditional 
market power can be applied. 

4.1 Premices 



It is noticeable that in most of the economics literature, price competition is understood 
as uniform or linear 1@ whereby the same price is charged for every unit sold. This 
definition thus excludes the popular strategies of volume discount or packages. Two 
reasons for this odd restriction can be advanced: the inertia of economic thought and the 
natural efficiency of economic relationships. Whereas the former does not warrant our 
attention, we shall deal with the latter: it is thanks to their market power that firms are 
able to escape uniform pricing and develop differential pricing. 

It is also customary in the literature on differential pricing to concentrate on revenue 
and thereby ignore cost. This practice would seem at odds with the underlying profit 
maximization objective of firms but it is almost without loss of generality regarding vari- 



able cost since we can integrate them in the revenue function which becomes net (of cost). 
Next, we observe that a private firm covers its fixed cost out of its producer's surplus and 
thus does not take those into account for the determination of its pricing strategy. The 
issue of costs however cannot be ignored when dealing with public or regulated firms. 
For that reason, chapter 17 on regulation will focus on cost and check the truth of an- 
other old maxim stating that "every tube must stand on its own". Lastly, our results on 
differential pricing shall equally apply to any firm with market power once the relevant 
elasticity of demand is used as we already argued when discussing the Lerner formula 
(cf eq. 3.4). 

4.1.1 Definition 

The vernacular definition of discrimination is an "unfair treatment of a person or group 
on the basis of prejudice" while the use in economics adheres to the original latin mean- 
ing which is to separate or distinguish. We shall use the expression differential pricing 
and the verb discriminate. 

A first observation about differential pricing is that it would never appear in a per- 
fectly competitive markets because the law of one price applies: buyers and sellers par- 
ticipating in a competitive market are bound to accept the market price and can only pro- 
pose a quantity to trade. Hence, it is necessary to hold a minimum of market power to be 
able to perform differential pricing. Whether this can be interpreted as anti-competitive 
is deferred to §9 on anti-competitive practices. 

In the previous chapter, we introduced the monopoly as a firm having the power to 
set the price while consumers retained their ability to set the quantity. This market 
relationship is quite similar to the "divide and choose" procedure to share a pie where 
one person cuts the cake in two pieces and lets the other person choose the piece she 
likes best. This story does not fit our intuitive representation of the monopoly as having 
"maximum market power"; rather one would expect him to make a "take-it-or-leave-it" 
offer to the helpless consumer in order to grab the greatest profit. This is what differen- 
tial pricing is about; finding contract proposals-tari/fs-that extract a maximum surplus 
from consumers. Practically, the firm uses the heterogeneity of their tastes to treat them 
differently. 

This economically founded discrimination (as opposed to that founded on moral prej- 
udice) can in theory be applied to human traits such as age, residence or occupation if 
it does not conflict with higher level laws protecting the fundamental rights of citizens. 
Even if the firm is prevented from applying such a direct discrimination, we shall demon- 
strate that it is still possible to discriminate by indirect means. This is why the concept 



of differential pricing should be remembered as "the sophisticated exercise of market 
power". 

Using the previous example of the novel, it is quite simple to recover the steps leading 
to the observed pricing behavior. The editor knows that different people have different 
WTP for the novel. He would like to be able to guess it, simply by looking at the client's 
face and then charge him/her that price. The first necessary step for successful discrim- 
ination is thus the gathering of information. Ideally, it is possible to identify the WTP of 
each and every single consumer; this is a limiting case never observed in reality but use- 
ful for the theory. Practically, discrimination is bound to be imperfect and uses statistics, 
market studies (sociology) and interviews (psychology) to segment the demand 2@ accord- 
ing to attributes like age, sex, occupation or the postal code (a good indicator of wealth, 
itself a good proxy of WTP). 

Next, when discriminatory pricing takes place, some consumers realize they are pay- 
ing more than others for the same good or service. Although the firm will always try to 
present the cheap price as a "special" discount over the regular price and not the reverse, 
these consumers will feel deceived and will rebel against this prejudice. 3@ A first solution 
is to sue the firm for discriminatory and unfair treatment. Historically, the discrimina- 
tion widely used by railways companies in the XIX th century for categories of freight 
or passengers generated public outrage, political reaction and governmental prohibi- 
tion. Later on, during the XX th century, discrimination based on human traits became 
unlawful in most countries. 4 ® We have thus obtained a second necessary condition for 
successful discrimination: legality. Yet, the legal framework for economic activity leaves 
firms free to set different prices for goods sold in different places, at different times or 
for different intended uses. For instance: 

• German cars are more expensive in Germany than in Italy. 

• Holiday packages are cheaper in September than in August. 

• Softwares are cheaper for home use than for professional use. 

• Mobile phone is cheaper for professional use than for home use. 

• Per capita income tax is cheaper for married couples. 

If discrimination is legal, arbitrage might nevertheless weakens it considerably since 
the lucky consumers who can buy the item at a bargain price might try to resale it to 
consumers whom the monopoly asks a high price. For instance: 

• Some Germans go to Milan to buy their German car. 

• Some people work in August and leave on vacation in September. 

• Some independents buy a home version of a software for professional use (hoping 
not to be sued for such an illegal practice). 



• Some people employ their professional mobile phone for personal use. 

• Couples get married to save on their tax bill. 

Arbitrage can even be such a threat that discrimination altogether disappear (as pre- 
dicted by the perfect competition model). Some internet vendors such as bookshops or 
airlines use computer cookies to detect if a visitor has already bought an item from them 
in which case they quote higher prices because they know that the visitor is serious 
about buying more items. It is enough to use a different browser or to erase cookies to 
perfectly bypass this intent of differential pricing. We shall come back later on the issue 
of arbitrage explaining especially how consumers intent to bypass discrimination and 
how firms intent to prevent arbitrage. 

4.1.2 Typology 

Table 4.4 summarizes our finding regarding the steps to follow in order to successfully 
perform differential pricing: 

• Differential Pricing: Extract consumer surplus by segmenting demand 

• Segmentation: Sort heterogeneity of tastes into homogeneous groups 

• Rent Extraction: Devise specific tariff for each segment 

• Necessary Conditions: Information, Legality and No-arbitrage 

Table 4.4: Stages of Differential Pricing 

Notice that segmentation refers to interpersonal price discrimination since the firm 
discriminates across consumers whereas rent extraction refers to intrapersonal price 
discrimination as the firm discriminates across units for the same consumer. 

The practical implementation runs into the difficulty of satisfying completely all three 
conditions of information, legality and no-arbitrage; some degree of incompleteness is 
often unavoidable. For instance, car insurance premium would be profitably based on 
the following binary classes: residence (city/countryside), gender (M/F), age (young/old) 
and accident records (some/none). This would lead to establish 16 different premiums. 
Yet the last characteristic is unobservable thus only 8 segments can be constructed. If 
a projected European directive 5 ® is adopted, gender discrimination will be prohibited, 
thus limiting discrimination to 4 segments. Finally, if most families involve young and 
mature people, they can always bypass the age discrimination (arbitrate) by sending the 
relevant member to apply for insurance. It might therefore be the case that the firm can 
only build two segments upon which (direct) discrimination is feasible i.e., city dwellers 
are being asked to pay a dearer premium for the same service. 6 ® 



It would seem that, within a segment where direct discrimination is impossible to 
carry out, there is nothing that the firm can do to boost profits. This is a very mislead- 
ing intuition because the firm can always resort to indirect discrimination. The idea is 
simply to propose the whole range of discriminatory proposals to everybody at the same 
time and let consumers pick the one they prefer, a behavior known as self -selection. The 
trick is to design these proposals appropriately in order that each consumer picks the 
proposal you would want him/her to choose. Those criteria are for instance: 

• Quantity: Unit price varies with the quantity bought 

• Differentiation: Several versions of the good are offered 

• Temporality: Price changes with seasons or with time 

Table 4.5: Forms of Indirect Price Discrimination 

We can summarize our typology with the help of Table 4.6. We adopt a logical ap- 
proach to discrimination reflecting the many theoretical developments of the last 20 
years; it is a departure from Pigou (1920)'s received typology where perfect discrimina- 
tion is refereed to as "first degree", indirect discrimination as "second degree" and direct 
discrimination as "third degree". 

• Perfect: Each segment is formed by homogeneous consumers 

• Imperfect: Segments contain heterogeneous consumers 

• Direct: Discriminate across segments 

• Indirect: Discriminate inside segments 

• Basic: Based on unit price only 

• Complex: Based on packages of price and {quantity or attribute} 

Table 4.6: Typology for Price Discrimination 

4.1.3 Consumer Surplus Extraction 

In this part, we look at all the methods available to extract the surplus of an individual 
consumer when his demand curve is known to the monopoly. We already know that 
these schemes have to be more complex than basic monopoly pricing since the latter 
leaves a net surplus to any of his clients. The three methods are the quantity rebate (or 
price listing), the club pricing (or two-part tariff) and the minimum purchase obligation 
(aka "take-it-or-leave-it" or "take or pay" offer). As explained in the introduction to this 
section, we focus on revenue and thus assume a very simple cost structure with constant 
marginal cost and zero fixed cost. We can already advance the result of this section: 



Conditional on information, legality and no-arbitrage, quantity rebates, club pricing or 
minimum purchase obligation are equivalent methods to rip off entirely the surplus of 
consumer. As a by-product, efficiency is restored when compared to standard monopoly 
pricing. 

Price Listings aka. Quantity Rebates 

Let us now introduce the first strategy to extract consumer surplus and demonstrate, 
after Pigou (1920), its efficiency using the movie rental example of §2.2.2 reproduced 
on the left panel of Figure 4.1. The client's WTP for the first, second, third ... movie 
rental enables to draw a staircase. Imagine now that the shop manager askes her to 
fill a survey in order to be able to guess her inverse demand curve (WTP) P{q) (dashed 
line). The seller can now make a personalized rental offer to the client: the first movie 
is proposed for the price p\ = P[l), the second one will be cheaper and is proposed at the 
price p 2 = P{2), the third unit is offered for a still lower price p 3 = P(3); this discounting 
process goes on until the WTP becomes smaller than the marginal cost c of a rental. The 
price of additional rentals then becomes equal to c. 

When offered this personalized price-listing, the client will elicit a number of movie 
rentals equating her WTP with c. 7@ The purchase is therefore the efficient quantity 
q* = D{c) while the profit of the monopoly is the area between the demand curve and the 
marginal cost curve i.e., the maximum market welfare W* (cf definition §2.3.2). For 
the case of linear demand D{p) = a-bp and constant marginal cost c, q* = a- be and 
W* = ±{a-bcf(ct eq. 3.5). 

Observe that the pricing list can be interpreted as a list of quantity rebates: the 
regular price is p\, a discount p 2 - p\ is offered on the second unit, a further discount 
P3 - p2 is offered on the third unit and so on. 

Two-part Tariff aka. Club Pricing 

Our second consumer surplus extraction strategy, first described by Hicks (1943) as the 
quantity compensating variation, involves a change of attitude: instead of renting ex- 
pensive movies inside a shop with free entrance it is wiser to charge an entry fee (make 
it a club) and then rent at a bargain price inside. 8@ 

To see why this is a good idea in terms of profits observe on the right panel of Figure 
4.1 that when the manager sets a per-unit price p, the consumer optimally buys Dip) 
units and derives a (consumer) surplus Wuip), the excess of her total WTP over her total 
expenditure. The consumer participation constraint is thus / < W^ip) and it is optimal 
for the manager to saturate this constraint i.e., ask his client an entry fee / as high as 
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Figure 4.1: Quantity Rebates and Club Pricing 



the derived consumer surplus W D {p). The per-capita profit is then n{f, p) = f +{p-c)D{p) = 
Woip) + pD(p) = W(p) by construction. As can be checked on Figure 4.1, meanwhile the 
price is greater than the marginal cost, there is an inefficiency generating a deadweight 
loss, which in this particular instance is also a profit loss. Formally, we have || = p - c. 
The optimal two-part tariff is thus to set p equal to the marginal cost c and / equal to the 
corresponding consumer surplus Wd(.c) = W* , which in this case is the maximum market 
welfare. 

The two-part tariff is commonly used by sport clubs or theme parks for kids; a rather 
expensive annual fee or daily entrance fee is charged but almost nothing is charged 
for the actual consumption of facilities or attractions. This is because these activities 
support a high fixed cost but a low marginal cost. Another instance where two-part 
pricing appears is for bundled goods involving a durable component and a disposable 
one like {camera + films} or {water filtering jar + mineral recharges} or {razor + blades}; 
the user must purchase one apparel to enjoy the service but can then freely decide how 
many disposable units to buy. We develop the analysis of bundling in §5.3.3 and §24.1. 

Lastly, most regulated services like telephone, gas, electricity, water are also priced 
using a two-part tariff. Coase (1946) advocates for the righteousness of this scheme 
as opposed to marginal cost pricing and public subsidies. The reasons advanced are 
the same as above, namely high fixed cost and low marginal cost; they and are further 
explored in §17 on regulation. 

Block Tariff An alternative pricing scheme also popular with public services is the 
block tariff consisting of one expensive price p\ for the units within the block of size q\ 
and a cheaper follow-on price p2- As shown by Gabor (1955), this method enables to 



extract the entire consumer surplus. 

Indeed, the final consumption elicited by the consumer will either equate his WTP P 
with p\ or p 2 . If the former case occurs, then we are back to linear pricing which is inca- 
pable of extracting all the consumer surplus, thus it must be the case that the consumer 
demands D{p 2 ). In order to extract W* , it is necessary to generate this maximum mar- 
ket welfare, thus necessary to induce efficient sales of q* which occurs only if p 2 = c, the 
marginal cost of production (recall D{c) = q*). The total revenue under the block tariff is 
then piqi + c(q* - q\) while total cost is cq*, the profit is therefore n = {p\- c)q\. To equate 
it with the maximum market welfare W* there are many solutions (we have a degree of 
freedom); we can pick some block size q\ and then set p\ = c+ ^-. If the consumer buys q\ 
units at price p\ plus q* - q\ units at marginal cost then the first block sales are cq\ + W* 
while the second block ones are c{q* - q\), so that total sales are W* + cq* leading to the 
desired profit of W* . 

Since the efficient quantity is q* , the gross surplus of the consumer is W* + cq* and 
by the very choice of the block tariff parameters, he pays exactly that amount, thus he 
will accept the block tariff offer and the firm will successfully rip the maximum welfare. 
The last possibility that must be set aside is that the consumer buys less than q\ units at 
the high price p\ which occurs if P(0) > p\ = c+^-. Since W* = ~q*{P{Q)-c), the condition 

Cfi <L 

we need is q\ < q* 12. Notice finally that choosing q x = 1 is nothing but a two-part tariff. 

The practical advantage of the block pricing method for regulated utilities with re- 
spect to a two-part tariff is to accompany the cycles of demand (cf §25.3 on peak-load 
pricing). Indeed, during the off-peak period, the client must pay the potentially large 
subscription of a two-part tariff even though he consumes few units and derives a small 
surplus; on the other hand, a block scheme enables him to buy the small number of units 
he needs, at the high price p\. 

"Take-it-or-leave-it" aka. Minimum Purchase Obligation 

The third strategy to extract consumer surplus is to bundle price and quantity and make 
a "take-it-or-leave-it" offer to the consumer. 9@ The fact that a quantity q is offered for 
a total price (transfer) t means that the consumer has to purchase a minimum number 
of units. Given our previous derivation of the optimal two-part tariff {p = c,f = W*), 
the optimal "take-it-or-leave-it" offer is simply to bundle the efficient quantity q* with 
a transfer equal to the consumer's utility (raw surplus) from consuming q* i.e., set t* = 
u{q*) = cq* + W* (this measure of utility u is defined in eq. (2.17)). Notice that t* is the 
total expenditure of the consumer under the optimal price listing and under the optimal 
two-part tariff. Once again, the optimal bundle leaves the consumer with a zero net 
surplus while the firm's profit reaches the maximum welfare. 



An equivalent contract is to offer a unit price of j- bundled with a minimum purchase 
obligation q, a scheme quite popular with mobile phone operators and manufacturers 
in the retail sector. Upon having accepted the offer \^*,q* , the client does not wish to 
consume more units since her WTP at q* is exactly c while the unit price is |r = c+^. 
Given that her expenditure is q* -^ = t* , she will accept the offer. 

Historically, Leontief (1946)'s departure point for the "take-it-or-leave-it" method was 
the comparison of the traditional wage negotiation where only the rate is agreed leaving 
the firm free to decide later on employment and a more innovative negotiation where a 
guaranteed wage is accorded forcing the firm to provide a minimum level of employment. 
He simply observed that the former was akin to a standard monopoly thus was ineffi- 
cient while the later was akin to a perfectly discriminating monopoly and demonstrated 
that the optimal bundle would be (Pareto) efficient. 10@ 



4.2 Direct Differential Pricing 

In Pigou (1920)'s language, this section addresses first and third degree price discrimi- 
nation which we deem perfect and imperfect separation of consumers within a market 
segment. The former case is rarely observed and serves as a benchmark for the latter. 

4.2.1 Perfect Discrimination 

It is obvious that if a monopoly is legally entitled to discriminate among his clients, holds 
all the necessary information on their preferences and can avoid arbitrage, he can rip the 
entire consumer surplus of each client using any of the previous three instruments. 11 ® 
The maximum welfare W* will be generated but contrarily to the perfect competition 
case, it will go entirely to the monopoly leaving no (net) surplus to consumers. 

Although not the most frequent form of price discrimination, there are real life sit- 
uations that approach this extreme case; for instance, when an expert seller bargains 
with a novice buyer over the price of a durable good such as a painting or a car, he can 
guess his WTP and offers him a rebate on an extremely high starting price that leaves 
zero net surplus to the buyer. Odlyzko (2004a) explains how the internet and telecom- 
munication (IT) revolution has made perfect price discrimination a near reality. Firstly, 
firms are able to guess our current WTP using our recorded past purchases and the in- 
formation we provide them when enrolling in fidelity programs. The yield management 
techniques initially developed by airlines and hotel chains are becoming extremely pre- 
cise (cf Netessine and Shumsky (2002) for an introduction). Secondly, IT technologies 
enable firm to dynamically change their prices at virtually no cost. The limit to this 



practice is the annoyance felt by consumers in the face of ever changing tariffs. 

The positive side of perfect discrimination wrt. the basic (non discriminating) mono- 
poly is the increase of consumption as it allows some intermediate income households to 
consume the good or service. Yet, the consumer surplus being nil, the law often forbids 
(perfect) discrimination to avoid such an extreme sharing of the welfare between pro- 
ducers (here the monopoly) and consumers. Notice also that perfect discrimination leads 
the monopoly to take efficient decisions in every respect but price. The costly quality of 
the product, its degree of complementarity with other goods will all be efficiently chosen 
because perfect discrimination makes the monopoly profit function equal to the objective 
function of a benevolent social planner. 

Let us stress the fact that the full extraction of consumer surplus is feasible only if 
the monopoly can guess the demand curve of each potential customer and is allowed 
to make personalized offers in either of the 3 above forms. It is also patent that our 
demonstration applies to any cost function and to any good or service for which the 
information and legality constraints are satisfied. 

4.2.2 Imperfect Discrimination aka "Segmentation" 

Differential pricing as understood by the general public is direct but imperfect price 
discrimination. We give motivate this practice with historical examples and then derive 
the optimal prices for each segment and the welfare consequences. We shall also see how 
the tools of consumer surplus extraction (complex pricing) can be used in the presence of 
heterogeneity within a segment. Because they provide a further advantage to the firm, 
these tools are sometimes forbidden, thus forcing the firm to use uniform pricing. Notice 
that when consumers only need a single unit, the price is by force uniform. This is the 
case for durable goods like cars or domestic appliances. 

Motivation 

As recalled in Odlyzko (2004b), differential pricing has been used for centuries in Europe 
and China in the area of transportation; for instance sea shipping, use of rivers, canals, 
turnpikes and then railways. Tolls depended on the type of vessel using the facility, the 
size of the vessel or the merchandise on board. The main defect for users was not so 
much the level of tariffs but their intricacy. The general rule that slowly emerged in 
successful ventures was to charge at "what the traffic would bear". 

Dupuit (1844) offers an early formal analysis of differential pricing as a mean to 
improve the profitability of bridges in France. His idea is to drop the single toll and 
establish instead a discriminatory tariff based either on clothing of the user or on the 



time of passage. In the first case, workers pay a low toll provided they wear a cap or any 
other cloth typical of their statute while other (richer) users pay a higher toll. 12 ® As a 
result of this scheme, the traffic on the bridge is increased as well as the total receipts; 
we can conclude that a more efficient situation has been reached because new users do 
no have to take a long detour, old users still pay the same price and owners now make 
profit from their bridge instead of incurring losses (under the low pricing scheme). 

Most often, market segments are exogenously determined by a criteria such as gen- 
der, age or geographical residence but as we shall show later on firms prefer to endoge- 
nously define segments along the WTP criteria, so as to come closer to perfect discrimi- 
nation. 

Exogenous Segments 

For some reasons, the market can be divided into segments upon which it is both possible 
and legit to apply differentiated prices. We thus treat each segment as a separate market 
assuming further that no arbitrage among them is possible. The optimal price in each 
segment therefore satisfies the Lerner formula (3.4) . If there are n segments, we obtain 
for i < n 

&i = Uei o [l-±)pt = C m (Q) (4.1) 

where Q = £;<„ is the quantity sold across all segments. This equation means that 
changes in elasticity are compensated by changes in the optimal price to maintain it 
equal to the marginal cost. Thus, a segment with a higher elasticity enjoys a lower price 
because it is more "combative" and forces the monopoly to offer a greater rebate. In the 
case of two segments, Robinson (1933) speaks of the strong and weak markets with re- 
spectively a higher and lower price wrt. the uniform optimal price (maximizing the sum 
of market profits). 

Equation (4.1) only characterizes differences in the treatment of each market. The 
vector of profit maximizing quantities q d = [qf)i< n is found by applying a simple algo- 
rithm inspired by the manna trick of §2.1.1. 13@ Assuming to simplify a constant marginal 
cost c, the monopoly profit is 

n(q) = L Vi i p i (#) - c ) = L f ' i R m,i W - c) dx (4.2) 

i<n i<n^° 

Given the quantities q = {qi)i< n already sold in each segment (initially zero), the firm 
ponders producing an additional unit. She computes the revenue RmjiqO she would 
derive from selling it to segment i and count the cost c as a special zero segment. The 
allocation rule is then to award the unit to the segment generating highest revenue, as 



if each segment was represented by an officer bidding for the additional unit; notice that 
the unit goes to the zero segment i.e., is not sold, if revenue bids are all below production 
cost. The analogy with an auction will be used in §22 on auctions. 

The analysis can be pursued graphically, considering for instance two markets; it is 
enough to equate q\ + q 2 = RmiW +^2^ with q = C~^{p) to obtain the total quantity 
q m and the marginal revenue r m (cf the right panel of Figure 4.2). Quantities are then 
given by q\ = R^ ^r™) and q 2 = ^2^™) while prices offered to consumers are p\ = P\{q\) 
and p2 = Piiqi)- Exercise: solve this problem for the linear demands D l {pi) = at - btpi for 
i = 1,2 and C m {q) = cq. 




qi q q2 q q m 

Figure 4.2: Optimal prices in segmented markets 



Welfare 

Obviously, differential pricing is weakly beneficial for the firm since she has always the 
option of not using it. 14@ Socially, it will be shown that this practice generates an alloca- 
tive inefficiency that may be compensated by an increased output (as monopolies price 
above marginal cost and sell to little). 

If demand is linear in all segments with Diipi) = a, - bipi and marginal cost constant 
then (4.1) is c = ^^- => qf = ^^ and price pf = ^. Notice now that q$ = Li< n qf 
is also the optimal quantity under uniform pricing since market demand is D {p) = 
T.i< n Di(p) = #o- boP with ao = T.i<n a i and bo = Y.i< n bi (it has the same FOC). 15@ The 
monopolist thus sells the same total amount whether she price discriminates or not. If 
she can, she earns more by having strong segments pay more and weak segment pay 
less. This is inefficient given that opening a second-hand market would allow mutually 
advantageous re-trade among segments. We thus obtain a necessary (but not sufficient) 
condition for welfare improvement: 



I Differential pricing increase welfare only if it increases output. 

Schwartz (1990) proves the claim even for decreasing cost, in two steps. Let q{f < qtf 
be total output under discriminatory and uniform pricing. We want to show W d < W u . 
Given the uniform price Piqfi), consumer express demand to maximize utility thus total 
utility (cf eq. (2.17)) is greater than if they face the segment prices pf associated with 
optimal discrimination. As cost only depends on output, welfare, which is utility minus 
cost, is greater if total output q* is sold uniformly. Next, we observe that at market level, 
the inequality q% < q^ implies w£ < W£ as consumer surplus rises with output, but also 
niqfi) < n(q{f) since the latter corresponds to the monopolist's ideal. The sum, which is 
welfare, must therefore increase. 

Applications 

Our findings also apply for foreign trade i.e., when a firm from country 2 sells the same 
good at different prices at home and abroad (country 1). The rationale of this pricing pol- 
icy is to grasp the heterogeneity of demand at home and abroad. If the home consumers 
are less reactive to price then p 2 > p\ and the abroad government (country 1) may accuse 
the firm of dumping (e.g., Japanese cars vs. Europe or USA). German cartels of metals 
and chemicals used this method in the early XX th century to gain control of foreign mar- 
kets; they maintained a high domestic price with the help of an import tariff and sold 
abroad at competitive price, sometimes at a loss. 

Another case are DVDs which were clearly priced in a differentiated way between the 
US and Europe during the 1990s. To avoid parallel imports from the cheap US market 
to Europe, 16 ® a technology had been devised and imbedded into DVD players to force a 
pre-determined match i.e., a DVD player bought in Europe would only read DVDs bought 
in Europe (cf. §12.3.4). Because differential pricing also greatly limited the availability 
of titles in Europe, sales of players were not taking off fast (cf. §24.2.2 on the issue of 
complementarity) so that retailers started to "hack" DVD players i.e., making them able 
to play any discs (among which the US ones). This form of arbitrage was so successful 
that it made its way back to the industrial makers of DVD players who "pre-hacked" the 
protection system at assembly stage. The machine would not violate the agreement with 
the DVD forum but the task of the retailer was considerably simplified since it was then 
enough to open the machine and turn a screw to deactivate the matching technology. 



4.2.3 Endogenous Segmentation 

If there is a natural segmentation, say between men and women, the firm can take 
advantage of it to price higher to the group displaying the highest average WTP but 
the ideal would be to ignore gender and divide the market into weak and strong WTP 
segments; the differences existing between the original segments would still be reflected 
into the new segments but in a manner that allows a greater surplus extraction. If, say, 
women spend twice more on clothing, their discriminatory price is greater than men's but 
ideally fashion victims are grouped together (may be one third male, two third female) 
in order to charge them much more. Another example would be air travel for which 
late buyers, business people, are more eager than early ones, tourists. If it is common 
knowledge that the price will rise with time, then tourist buy early to enjoy a discount 
while business people buy late when they learn their exact needs. In absolute terms, this 
is not direct discrimination given that a tourist can always book a business ticket but the 
versions are made so as to effectively separate the strong and weak segments and force 
them into the desired price categories. 

Vertical Segmentation 

Under endogenous discrimination, profit rise faster (wrt. exogenous discrimination) and 
welfare is enhanced. We prove this claim within the standard linear model. 

Demand q = a - bp is divided into a strong segment q\ = a - bp\ and a weak segment 
made of those who enjoy the discount q 2 = a- q\ - bp2 (obviously, type #1 are barred from 
enjoying the discount). With marginal cost c, profit is 

n = qiiPi -c) + qiipi -c) oc qi{a-bc- qi) + q 2 {a-bc- q\ -qz) 

thus the two FOCs are 2q\ = a-bc-q 2 and 2q 2 = a-bc-q\. Their solution is q d = q d = 2 ~y £ 
leading to larger sales than under uniform pricing since q u = ^ £ . Profits are n u = [a ~ 4 b b c] 
and n d = ^^ while welfare is W u = 3(fl ~^ c)2 and W d = 4(a ~ b foc)2 (recall that the maximum 
is (a ~2fr C) )• The limit of this process of segmentation by WTP is perfect discrimination 
which is conducive of full efficiency. The comparison with exogenous discrimination is 
quite simple. Assume, for instance, the existence of two segments with demands param- 
eters d\ = 2a 2 = 4f and b\ = b 2 = b/2, then p d > p d (segment #1 is stronger) and total profit 

i^. ,,-_ (2a/3-fcc/2) 2 {a/3-bcl2) 2 _-.ii, a 2 „. w d 
lb il ~ Abl2 * 4fc/2 - " t" 36b ^ u ■ 



Innovation 

Innovation or the choice of quality is another avenue through which the firm can modify 
the market perception of differential pricing. We already know that a firm with market 
power e.g., a monopolist, wishes to to discriminate between a strong and a weak seg- 
ment. In the standard linear model, we saw in §4.2.2 that total output does not change 
wrt. uniform pricing which means that the allocation is inefficient. 17@ Now, if we ac- 
count for the presence of quality and its selection by the firm (to maximize profits), then 
price discrimination may be welfare improving and even beneficial to both the weak and 
strong segments. The reason exposed by Ikeda and Toshimitsu (2010) is that price dif- 
ferentiation allows to capture a greater share of the strong segment WTP; the firm's 
marginal return to quality is thus enhanced and she is lead to choose a higher quality 
(wrt. uniform pricing). Since this is an efficiency enhancing change, it may compensate 
for the mis-allocation between strong and weak segments. 

To check upon this claim, consider the model of §3.3.1 for two segments with a\ = 
1 (strong segment) and a 2 = 6 e [0; 1] (weak segment). If discrimination is impossible, 
the two segments are pooled together and, thanks to the simplicity of the setting, the 
new demand is twice that of a segment with parameter a = -^ which is an average 
between 6 and 1. The optimal price is thus p = ^ which is also an average between 
the prices that are optimal in each segment, p d = y and pf - |. This, in turn, leads to 
distorted sales q 2 = j^ < qf = \ = q d < q\ = j^g. Lastly, profit is n = ^ <n d + n d = a+ ^ )s , 
the discriminatory payoff 18@ As shown in §4.2.2 and confirmed in this example, total 
output is the same under uniform and discriminatory pricing (it is unity) but welfare is 
lower under discrimination. 

This analysis is correct for a given quality but once we take into account the ability 
to adjust quality at the margin, we reach a different conclusion. Indeed, with a cost of 

2 

producing quality y, the optimal quality level is simply the multiplying factor in the 
profit i.e., ^ under uniform pricing and the larger ^p under discriminatory pricing. 

2 

Taking into account that the net profit in equilibrium is y, choosing a higher quality 
unmistakably leads to higher profits. We have thus shown that discrimination allows 
to earn more by investing more into quality. The strongest result of these authors is 
that price discrimination is also welfare enhancing i.e., the discriminatory firm chooses 
such a high quality that the welfare gains (for society) from enjoying higher quality 
outweigh the allocative losses. Under differential pricing, welfare is y in the strong 
segment and yp in the weak one, thus net welfare is W d = 3s{1 + 5] - 4^. Given the optimal 
quality s d = ^, final welfare is W = (1 |g ] . For the uniform case, net welfare consist of 
5 elements, 2 profits, 2 consumers surpluses and cost for quality which simplifies into 
W u = ffi+J^ s-y- As the optimal quality is s = j^, we derive W u = §J±±0 and it is a 



matter of algebra to check that W >W o (1 -6) 4 > 0. Lastly, it may be verified that both 
segments benefit from price discrimination. 

I For a product whose quality is endogenously chosen by a firm with market power, price 
discrimination is both privately and socially desirable. 

Time Segmentation 

Market segmentation opportunities are sometimes determined by external forces such 
as country borders (and thus different legal systems) or gender but in many instances, 
firms endogenously fine tune their segments using criteria such as age or time schedule 
to determine whether the client benefits from a rebate or not. Let us use the happy 
hour example with T being the time limit where prices return to normal. Afternoon is 
the weak demand segment (#2) whereas evening is the strong demand segment (#1). We 
further need to assume that demand at time t is d{t,p) (independent of demand before 
or after). 

When the monopoly moves the threshold T later, he loses (pi - c)d{T, p x ) over seg- 
ment #1 and gains {p 2 - c)d{T, p 2 ) over segment #2. The optimum is thus found when the 
opposing forces are equal. Since the demand increases by d{T,p 2 ) -d[T,p\] this change 
is always welfare improving. 19@ We may thus conclude that the monopoly restricts too 
much the segment with low WTP i.e., happy hours are too short and likewise rebate 
conditions in transportation are too stringent. 

4.2.4 Proviso 

In this part, we deal with various reasons why differential pricing may not be used or 
used unexpectedly. 

Positive Discrimination 

The law generally forbids discrimination based on human traits but tolerates exceptions 
that serve the government's objectives in which case we speak of positive discrimina- 
tion. 20@ Redistribution of income is one important motive for permitting discrimination. 
For instance, young and old people benefit from preferential tariffs for many services that 
are under public regulation (e.g., transport) because it is a mean for the government to 
redistribute income within the population. As an extension, private firms are authorized 
to discriminate young and old people, as long as, they get better conditions than regular 
clients (e.g., cheaper tickets in movies theaters). Another curious case of positive gender 
discrimination is the free entrance for women in night clubs; men welcome this practice 



since the extra price they pay is compensated by a greater feminine attendance which 
after all is the reason why they patronize those clubs. 

In the case of insurance, discrimination based on a variety of human traits is oc- 
casionally permitted on grounds of fairness because it enables to adjust the individual 
premium to the expected cost of the individual; this way no group is forced to subsidize 
other groups. Taxi fares are regulated but in recognition of the painfulness of night 
driving, they are allowed to be more expensive during the night. 

Arbitrage 

Arbitrage is a complex reaction to discrimination attempts and is inter- wined with infor- 
mation and legality. 

A first strategy to avoid discrimination is to disguise oneself or mask one's intentions 
by adopting the iron mask of poker players; for instance, in a flea market, the experi- 
enced seller is able guess your WTP from your cloths and attitude, thus you better dress 
casually and pretend to be vaguely interested if you want to make a good deal. For the 
same reason, some bidders in art auctions prefer to participate by phone rather than 
appearing in person in the auction hall. Secondly, consumers might try to defraud the 
scheme by pretending to be eligible for a better price i.e., your partner buys a software 
suite for home use and then lets you use it for your business or you buy a one-license 
software and install it on 20 computers. We won't pursue this possibility since economics 
always assume that laws are exactly enforced (unless we study this specific problem). 

For most services, arbitrage is easy to avoid because the contractual relationship is 
between the firm and a particular consumer, so that the latter cannot resale the service 
because he is not the owner of the service, he is just entitled to receive the service (e.g., 
plane ticket). However, for expensive items like a yearly subscription to opera, trans- 
ferability is often permitted to enable the customer to share the cost with friends and 
family. 

Arbitrage is a reality for goods for the simple reason that the transaction involves a 
transfer of ownership. The means to circumscribe arbitrage on goods are: 

• Trademarks: a legal limitation to resale studied in §12.3.3. 

• Warranties: offered exclusively to the direct buyer of the (durable) good; they de- 
crease the value of resold items. 

• Security Concerns: authorities limit the number of dealers who can lawfully sale 
and resale the item but oblige them to guarantee reliability and security of the 
product (cf. §9.1.2). 



One important practical case is the European car market; it used to be quite complex 
to buy a car in a European country and import it into another one because the vehicle 
had to pass security inspections. Unless the price difference was very high, it was not 
worthwhile to spend time and money to perform the arbitrage. With the advent of the 
internal market and the progressive implementation of the new regulation of 2002 for 
the car sector, these limitations have disappeared. Nowadays, there are intermediaries 
who buy new cars in countries where manufacturers prices are cheap in order to resale 
them in countries where manufacturers prices are dear. 

What this example highlights is the existence of a transaction cost associated to the 
resale operation. Prohibition amounts to an infinite transaction cost while in the case of 
perfect arbitrage the cost is zero. The transaction cost t is thus protecting the monopoly 
in its intent to price discriminate among categories of clients. Returning to our analytic 
model and considering two segments, if t > p 2 - pi, then the arbitrage poses no threat 
to the monopoly. Otherwise the monopoly cannot freely discriminate because arbitrage 
would drive the high price down to p\ + t; the optimum is then to set a higher bottom 
price p\ and a lower top price p 2 = p\ + t. 

Seasonality potentially enables firms to directly discriminate but arbitrage is also 
present although less active when it comes to services. In many retail sectors, sales 
on Saturday are notoriously larger than those of the remaining days of the week, thus 
firms ought to raise their price on Saturdays. One first obvious obstacle is the cost of 
changing price tags twice a week but another much more compelling is that many goods 
that can be cheaply stored (e.g., books using shelves or fresh food using a refrigerator) 
so that arbitrage is a meaningful threat at the disposal of consumers to force outlets 
to maintain constant prices. For a larger time span, arbitrage becomes expensive so 
that shops can take advantage of the demand peak; this is why prices for luxury items 
increase one month before Christmas and decrease after new-year's eve. 

Regarding services such as travel, consumers can arbitrate and wait the off-peak 
period to enjoy lower prices. Yet, they suffer an opportunity cost of switching their con- 
sumption to the off-peak period (e.g., they must free paid working time to go shopping 
during week days or free leisure time to go wash the car during the week-end), thus the 
firm can maintain a price difference. Even so, congestion will frequently appear during 
the peak period (capacity is exhausted) which means that the pricing of these services 
goes beyond simple revenue maximization and spills over cost savings and investment 
policy; this issue is tackled in §25.3 on peak-load pricing. 



Complex vs. Uniform pricing 

For most services, it is technically possible to record detailed consumption and charge 
the client according to it. Yet this option is not as frequently used as theory would 
predict, this because of several drawbacks. Firstly, it requires an extra fixed cost in the 
form of labour and capital investment to perform metering, billing and monitoring (avoid 
free riding and defrauding). Secondly, per-unit pricing, by disrupting service provision, 
tends to lower quality and thus reduce the global WTP for the service. Thirdly, the 
previous annoyance cannot be eliminated through ex-ante contracting because ex-post 
contingencies (i) force parties to fine tune the characteristics of the service in real-time 
and/or (ii) change the value of the service in real-time, making the determination of 
price difficult. In both cases, haggling and inefficient opportunistic behavior is bound to 
appear (cf §14.2 on hold-up). If these costs exceed the extra profit afforded to the firm 
by differential pricing then only access charging will be observed. 21@ 

Another compelling reason why many services only involve access or membership 
charging is because the optimal pricing scheme exhausts the gains of trade with the 
client and this amounts to price at marginal cost which is often zero. This claim is well 
known for the monopoly case (cf. §4.1.3) and since it does not depends on the existence 
of competition nor on its degree, it readily extends to other market structures. Each 
client is then characterized by his WTP for the global service (i.e., access and utility 
maximizing consumption) and it is on this dimension alone that firms compete. 

4.3 Indirect Differential Pricing 

After presenting some examples of indirect discrimination based on quantity rebates, 
we proceed to study the design of an optimal scheme of indirect discrimination in two 
steps. Considering a market made of a weak and a strong segments, we first look at a 
"no-discrimination" case i.e., the firm offers a unique tariff to his whole clientele. In a 
second step, we show how the addition of a second tariff can improve profits by leading 
heterogeneous consumers to sort themselves out. Whether the menu of tariffs is offered 
depends then on transaction cost i.e., whether it is not too costly to create, manage and 
control its correct implementation, including the backlash of angry consumers who have 
a tendency to hate blatant discrimination. 

As for the pricing tools used by the firm, recall that in a competitive market the 
consumer is faced with the linear (aka uniform) tariff T{q) = pq where p is set by the 
market while in a monopolized market, the firm has the ability to go beyond uniform 
pricing and offer any potentially complex quantity-price relationship (aka non linear 



pricing). The latter will often include a subscription, something entirely absent from a 
competitive market for "free entry" drives it to zero. 



4.3.1 Quantity Rebates 

Quantity rebates are present everywhere around us; for instance, the price per kilo of 
many food items diminishes with the size of the container (much more than what the 
savings on the container would permit) as the example of Table 4.7 shows (prices quoted 
in November 2005 for exactly the same wine). Some people then buy the compact size, 
others prefer the regular size and the remaining prefer the family size; in the end ev- 
eryone has picked the version best suited to him/her in terms of size and price (cf also 
Table 4.1). 



Bottle size (liter) 


1/5 


3/8 


1/2 


3/4 


Price (€) 


1,5 


2,25 


2,35 


2,65 


Price / liter 


7,5 


6 


4,7 


3,5 


Mg. Price 


7,5 


4,3 


0,8 


1,2 



Table 4.7: Retail Prices for Wine 

Another popular example of quantity rebate is the "buy-one-get-one-free" (BOGO) 
promotion at your local pizzeria for take away (instead of home delivery). Suppose you'd 
pay up to 14€ for a first pizza but only 7€ for a second one, then the BOGO package at 
18€ will appeal to you. Selling pizzas at the same average price of 9€/piece would trigger 
only one sale, thereby halving the profit (assuming negligible production cost). 22@ 

The BOGO also acts an indirect discrimination scheme because high WTP clients who 
bother walking to the pizzeria pay the regular price of 18€/piece to get home delivery; 
consumers sort themselves out in two distinct segments paying different prices for the 
same basic good, pizza (cf. §4.3.1). Lastly, the BOGO acts an intertemporal discrimina- 
tion scheme because it is only active from Monday till Thursday when business is low 
(off-peak period). During the week-end when customers display a higher WTP (this de- 
fines the peak period), the restaurant has the opportunity to sell pizzas at the regular 
price of 18€/piece without generating any hard feeling (cf. §25.3 on peak-load pricing). 

Mobile phone price plans like many services use two-part tariffs and can be designed 
to incorporate quantity rebates i.e., the average price is decreasing with the contracted 
monthly number of minutes. Table 4.8 show some data from a carrier of a large European 
country. Thanks to the clever design of these tariffs, an indirect discrimination occurs 
because each person picks the proposal best suited for him/her and ends up paying and 
consuming varied amounts. Clients have sorted themselves out for the benefit of the 
firm. 



Monthly Fee (€) 


18 


28 


36 


43 


61 


77 


90 


126 


Allowance (hours) 


1 


2 


3 


4 


6 


8 


10 


15 


Av. Price (€ct./mi.) 


30 


23 


20 


18 


17 


16 


15 


14 


Mg. Price (€ct./mi.) 


30 


17 


13 


12 


15 


13 


11 


12 



Table 4.8: Mobile phone price plans 

The tariffs presented in Table 4.8 form a menu, the design of which is extremely 
complex. A first step to understand the characteristics of the optimal menu is to look for 
the optimal single tariff. 

4.3.2 Optimal Single Tariff 

In this paragraph, the firm voluntarily limits herself to propose a unique tariff to both 
segments, for instance because direct discrimination is either prohibited or too costly to 
implement. The firm must arbitrate between market coverage and mark-up. Indeed, 
she may cater to all segments with a low subscription fee, an inclusive strategy, or tar- 
get the strong segment to extract more consumer surplus from it with a high fee, an 
exclusive strategy. Obviously the optimal strategy depends on the respective sizes of the 
two segments. Lastly, a "middle of the road" solution that mixes the two previous ap- 
proaches (and improve upon them) is the three-part tariff consisting in a subscription, a 
free allotment of units and a per-unit price for additional units. 

In the more general case where the market is populated by a continuum of heteroge- 
nous clients, ranked by WTP from strongest to weakest, the optimal strategy is to set 
an exclusion level through the access fee and then apply an aftermarket mark-up (i.e., 
price above marginal cost) in order to extract surplus from strong clients. One then 
speaks of price discrimination by "metering" since clients are sorted out by how much 
they consume in the aftermarket (cf Blackstone (1975) for an historical case). 



Inclusive Tariff 

Oi (1971) derives the optimal inclusive (in) two-part tariff when the monopoly faces 
strong and weak segments such as poor and rich households or home and professional 
users but is not able to discriminate directly among them. With the help of the left panel 
on Figure 4.3, we shall demonstrate that pricing at marginal cost is not optimal anymore 
once there is some heterogeneity in the demand (in contrast with the individual surplus 
extraction characterized in §4.1.3). 

The demand functions of a weak and strong consumer satisfy D x < D 2 . Starting with 
pricing at marginal cost c, the demands are ql and q£ generating maximum consumer 



surpluses W* x and W* 2 . To attract both consumers, the monopoly can set the common 
subscription at F = W* x < W* 2 . Consider now raising the unit price to p > c; this gener- 
ates a sales profit from consumer #1 measured by area G on Figure 4.3 but also a slightly 
bigger reduction G + a of his surplus (a, like j6, is a deadweight loss). In order to keep 
this client, the monopoly must reduce his subscription by exactly the same amount i.e., 
AF = -G-a. The operation therefore involves a small loss, area A, over consumer #1. 
Regarding consumer #2, the subscription loss is G + a while the gain from additional 
profitable sales is G + a + H as seen on the left panel of Figure 4.3; overall, there a benefit 
H from raising the price above marginal cost (assuming equal size populations for both 
types). 



non claimable surplus 



claimable surplus 




qi qi q 2 q 2 A= qi q3 q 2 

two part tariff three part tariff 

Figure 4.3: Two and Three Part Tariffs 



To derive an exact formula, we use the individual demands D\{p) = a\ - b\p, D 2 {p) = 
a-2 - bzp and denote p e]0;1[ the proportion of people with low WTP in the overall pop- 
ulation. We use " to denote the average of a variable or parameter, hence demand is 
D{p) = a- bp. Within an inclusive strategy, the monopoly optimally sets F = W^iCp) to 
attract both types of clients, hence profit is U in {p) = W dt i(p) + {p- c)D(p). 



ip-c)D'(p) + D{p) -Di{p), we have 



an' 



dp 



p=c 



Since %- = 
> given that average demand exceeds that 



of the weakest segment. This is the analytical equivalent to the graphical demonstra- 
tion given above that the inclusive monopolist inefficiently price above marginal cost to 
take advantage of the strong segment (cf also generalization below). The optimal policy 
solves = ^ o{p-c)b = D{p) - Diip) => p in = a+ l c , ai ■ It is a matter of exercise to check 
that the basic monopoly price corresponding to free subscription and consumption by 



both groups is p M = &p > p in } z@ 

Exclusionary Tariff 

The exclusionary tariff (ex) that deliberately excludes the weak segment (#1) from con- 
suming is found by setting F ex = Wd, 2 {p) and p ex = c to extract the whole consumer surplus 
of people within the strong segment (#2). Comparing the two policies, inclusive and exclu- 
sive is quite simple since the profits can be decomposed as follows: U m = aU[ n + (1 - a)U l 2 n 
and U ex = aY\.\ x + (1 - a)U 2 x with nf* = given that segment 1 is excluded but U 2 X > U l 2 n 
since the profit made on segment 2 is maximized. It is therefore obvious that keeping all 
consumers as clients is worthwhile only if their proportion in the population a is large 
enough. 

In real situations, there are many categories of clients so that firms rationally ex- 
clude the classes with a low valuation for the service (often poor households) in order 
to extract surplus from more profitable clients. Typically, large industrial buyers are 
favored by such pricing policies. This is why quantity rebates are sometimes forbidden 
by authorities to protect small buyers. The same rule applies for monopsonies, as in the 
case of the supermarket chains who devise such two-part tariffs to extract the surplus of 
food producers. 

Three Part Tariff 

Oi (1971) further rationalizes the use of three part tariffs consisting of an upfront pay- 
ment (transfer) t, a free allotment of time q and a price p for additional units. These 
schemes were popular in the lease of mainframe computing power in the 1960s (cf Wald- 
man (1997) and §24.1) and are now widely used by mobile phone carriers. The data on 
mobile phone plans given in Table 4.8 also include a price p = 0.45 € for extra minutes, 
which is far above the maximum average price tl q of any plan. 

We plot on the right panel of Figure 4.3 the two individual demands we previously 
considered; imagine that the optimal inclusive two-part tariff features the price p leading 
to purchases q\ and q 2 by the two types of consumers. The fee is F = W^iip) shown by 
the grid area under D\. The monopoly can still improve on this scheme by offering his 
clients a three-part tariff where the allotment is A = q± and the fee is t = F+Ap. Whatever 
the choice of the unit price p > p for extra units (above A), consumer #1 is indifferent 
among the two plans because he won't consume extra units with the three-part tariff. 
By construction of this three-part tariff, the monopoly makes the same profit over this 
client. If consumer #2 agrees to the tariff change, he will consume the allotment and 
express a residual demand shown by the dotted line parallel to D 2 on the right panel of 



Figure 4.3. This means that some of his net consumer surplus under the two-part tariff 
can be reclaimed through standard monopoly pricing: we equate the residual marginal 
revenue with marginal cost to obtain an optimal quantity q 3 < q 2 and the corresponding 
price p. As a consequence, client #2 reduces his consumption and the monopoly succeeds 
to increase profits by recapturing part of consumer #2's surplus (the stripped area). 24@ 

Generalization 

If, following Littlechild (1975), we allow for more classes of consumers (in- stead of two), 
it is easiest to consider a continuum indexed by a characteristic 9 e [0;1] running from 
most to least interested by the item for sale i.e., the WTP P$(q) of a consumer with 
characteristic 6 satisfies -^ < 0. When faced with the unit price p, user 6 demands 
Dg{p) solving Pg{q) = p and derives consumer surplus W Di g{p). It is easy to check that 
both functions are decreasing with the characteristic 8 (cf §2.16). Hence, if we let p be 
the marginal user for which W^^ip) = f, then the firm's clients are all the consumers 
with a characteristic < p so that choosing / is equivalent to choosing p. This setting 
allows to introduce the fixed cost y of serving a customer. The per-capita profit with 
type < p is ng{p,p) = W D} ^{p) + {p - c)Dg{p) - y whereas welfare is the larger Wg{p) = 
w D,e(p) + {p- c)D g {p) -y. Letting H denote the law of population distribution among 
characteristics and D^{p) = -j^-j Jq Dg{p) dH{6) = E[Dg\0 < p] being the average demand of 
consumers with characteristics lesser or equal to p, profit reads 

n(p,p)=[ n e (p,p)dHm = {W D ^p) + (p-c)D^p)-y)H(p) (4.3) 

Jo 

while welfare is 

w(p, P )= [ w 6 {p)dme) (4.4) 

Jo 

Since the consumer surplus satisfies — j^- = -Dg{p) (cf. §2.16), the FOC on price for 
(4.4) characterizing an efficient allocation is = ^ = fjf ^ dH(6) = (p - c) ff D g (p)dH(6)^> 
p = c, as expected. The situation is different for the monopolist since the FOC on price 
for (4.3) is D^ip) -D^ip) = (p - c)D^{p). Introducing the (average) demand elasticity e = 



Duip) 



, we get 



p-c - - p-c 1 1 DJp) \ 

£ eD^(p) = D^ip) - D^ip) oS£ = ^ = - 1 - -J^— (4.5) 

i.e., the standard Lerner formula (cf. eq. 3.4) except for the presence of the ratio of 
marginal to average demand which tends to weakens the firm's market power. 



If consumers are identical then the margin and the average are identical so that 5£ is 
nil which means that profit arises from the entrance fee since the price is optimally set 
at the marginal cost. If instead consumers are heterogeneous and are sorted by WTP, the 
marginal one demands less than the average one so that price is inefficiently (socially 
speaking) greater than marginal cost, yet set optimally (from the point of view of the 
private firm). The model can be extended to any type of heterogeneity and yield the 
same formula, only that the relationship between average and margin can go either way. 

Overall, we may conclude that the monopoly excludes too many people and charge 
too much to its clients. Indeed, since Wq-tiq = W Dr g{p) - Wo^ip) > and the monopoly 
optimum satisfies g = = g, we have ^ = ^ ex D^(p)-D^p) < and ^ = -^ff(/x) > 
0. 

Lastly, we characterize the privately and socially optimal clientele size wrt. the fun- 
damentals of the market. The FOC for (4.4) characterizing an efficient allocation is 
= llu ^ w d,h(c) = T i- e -> the market is thus efficiently served up to the person whose 
maximum consumer surplus is equal to the cost of service. The size FOC for (4.3) is 
expressed in terms of the access fee / = W Di ^ and is 

n^p, p)h(p) = ~^-H{p) of-J= ^-±^ -{p- c)D^(p) (4.6) 

op h{p) p 

which is negative if the profit to be made in the aftermarket is large and average elas- 
ticity of demand to income e^ is low 25@ This happens if there is a large heterogene- 
ity among clients regarding service use but little regarding subscription i.e., everybody 
wants to enjoy the good but at varying levels of intensity. If so, the monopolist offers a 
"bargain-then-rip-off ' scheme as we have / < y and p » c. 

4.3.3 Optimal Quantity Discrimination aka Non Linear Pricing 

The issues dealt with in this section belong to the more general theory of screening 
developed in §21.2. 

Self-Selection 

As we saw with mobile phone price plans, firms devise not one but many complex tariffs 
to cater to the heterogeneity of their clients. Their idea is to propose tariffs called green 
and red to both professional and home users. If everybody picks the green tariff, it is 
better to throw out the red one rather than annoying people with it; in that case, we say 
that the firm pools clients because nothing distinguish one type from the other. If, on 
the other hand, professionals choose the red tariff while home users pick the green one, 



then we might as well rename the tariffs "pro" and "home"; in that case we say that the 
firm separates clients as they identify themselves when picking an offer among the two 
proposals. 26 ® 

The optimal policy of the firm involves tree steps; firstly design the best pooling tariff 
which was done in the previous part; secondly design the best pair of separating tariffs 
which shall be done right away; lastly compare the two strategies and select the optimal 
one in terms of final profits. This is a complex task which we will only touch; an early 
reference is Maskin and Riley (1984) while the classical treatment is Wilson (1993). 

When designing the "pro" and "home" tariffs, the firm must always take care that pro- 
fessionals prefer the "pro" over the "home" proposal while home users hold the reverse 
preference. We now see clearly that indirect discrimination is more stringent than direct 
discrimination because under the latter framework, the firm can force professionals to 
buy the "pro" tariff (or not consume) and force home users to buy the "home" tariff (or not 
consume). The firm has therefore more freedom to design tariffs under direct discrimina- 
tion. We shall see that indirect discrimination compels the firm to leave an information 
rent to customers with a high WTP. 

Graphical Analysis 

The individual demands are the low one D/ from home users and the larger one D^ 
from professionals. The optimal discriminating two-part tariffs is to charge units at 
marginal cost c and ask for a subscription F* = W Dii {q*) equal to the consumer surplus 
corresponding to the efficient demand q* = Dj(c) of group i = l,h. This pair of tariffs is 
illustrated on the left panel of Figure 4.4 where F* is the area of the small triangle while 
F* = F* + A* h + B* h . An equivalent method is to offer the optimal discriminating bundles 
(q* , t* ) for i = I, h with t* =q*c + F* . 

If the monopoly cannot force consumer i to buy the two-part tariff [c, F*) but is obliged 
to propose both tariffs to both consumers then all will go for {c,F*) since the unit price 
is the same while the subscription is cheaper. 27 ® The optimal discriminating bundles 
also generates the same ordering i.e., everybody prefers (</ z *,tp over {q^,t^) so that the 
monopoly earns F* with both consumers. Yet bundles and two-part tariffs are different 
from the point of view of a type h consumer; indeed, when picking {c,F*) over (c,F*), 
he derives a net surplus of A* + 5* since he consumes q* h units. Now, when picking 
bundle (<7j*,f*) over {q* h ,t^), consumer h derives the smaller net surplus A* because his 
consumption has been reduced. 

This observation indicates a first improvement for the monopoly: offer bundles {q*,t*) 
and {q* h ,th) with tu = t* h - A* h to convince consumer h to accept the second one; indeed, it 
leaves him a net surplus (also called a rent) of exactly A*. The profit over consumer 



h therefore pass from FT to F* - A* = FT +B* i.e., part of the lost surplus has been re- 
claimed (cf left panel of Figure 4.4). 28@ To reclaim further surplus from consumer h, it 
is necessary to scrutinize the optimal quantities. 
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Figure 4.4: Self-Selection among Tariffs 



A second improvement is shown in the right panel of Figure 4.4 where qi is reduced 
below the efficient level q* ; this obviously reduces the surplus of a type / consumer so 
that the transfer t\ must accordingly be reduced to maintain him indifferent and keep 
him as a client (recall that he derives zero net surplus from the bundle). Overall there 
is downfall in the profit made over consumer / because a deadweight loss (grey triangle) 
has been created and the monopoly must assume it. Yet the drop of profits is small 
because the distortion we are imposing starts from the efficient level. The great benefit 
of this distortion is to strongly reduce the attractiveness of [qi, ti) for consumer h. His 
"non claimable surplus" has been cut so that his transfer th can be raised. The stripped 
area represents the final net gain. 

In this process, the quantity q^ is kept at the efficient level q* h since it generates a 
surplus B* h that can be siphoned by the transfer. The limit to this process of reducing 
qi is achieved when the loss made over consumer / cancels out with the gain made over 
consumer h i.e., when the left side of the grey triangle has the same height as the left side 
of the stripped area. 29@ Hence, area A; r will never be reduced to zero and this means that 
consumer h enjoys a positive net surplus. We call it an information rent to emphasize 
that it comes from the impossibility for the monopoly to identify him and fully extract 
his consumer surplus; somehow, he is able to hid behind the low WTP consumer (/). 

More generally, the limit to the distortion process illustrated on Figure 4.4 depends 
on the proportion of each group in the total population since in our graphical presenta- 
tion we were considering a one-to-one situation (50% home users vs. 50% professionals). 



The same analytical problem will be treated with more rigor in §21.2. The general con- 
clusions that emerge from that theory are: 

• For all buyers excepts those with the highest WTP, the quantity consumed is 
inefficiently low. 

• Tariff design is limited by the fact that a hight WTP consumer tries to grab the 
package set for the person immediately below him in terms of WTP. 

• The buyer with lowest WTP derives no net surplus (indifferent with not consum- 
ing) 



4.3.4 Discrimination by Differentiation (Versioning) 

This last section presents shortly the remaining forms of indirect discrimination that are 
based on the heterogeneity of tastes regarding quality or attributes of items and on the 
moment at which they are consumed. 

An early case of indirect discrimination by quality differentiation is seating in the- 
aters that has been in use over centuries. The cheapest rate allows one to stand to see 
the act; at a higher price, one can seat on a bench in a "first-come-first-seated" fashion; 
by paying a premium, one gets a reserved chair and finally, the wealthiest people hold a 
box all year long. 

Regarding network industries, the oldest example is probably the 3 passenger classes 
introduced by all railways companies during the XIX th century. At the time, the third 
class was treating passengers very harshly and many advocated that the railways com- 
panies should spend the small cost of improving their comfort. Dupuit (1849), comment- 
ing these schemes, answers them with an argument of self-selection: if the wood benches 
in the third class were covered with a cheap cloth, many second class users would switch 
to the cheaper third class given that the price difference would cease to be worth the 
quality difference. If this were to happen, the company would need to compensate sec- 
ond class users, for instance by introducing a moving service inside the second class cars. 
But then the first class users would feel unjustified the premium they pay. 

To implement the class system, it is necessary to maintain a large quality difference 
between any two adjacent classes. For trains, the third class disappeared in developed 
countries when everybody became able to afford second class while nothing justified rais- 
ing the first class rate to the sky. Planes offering very long trip are still able to maintain 
3 classes thanks to the "Sleeper-Seat" innovation that justify the premium between the 
first and business classes; the reader has probably noticed that tourist class in these 



flights is no more fun than the wood benches in the third class railways of the past, but 
this is a necessary evil if we want to be offered cheap tickets. 

Just like third class in railways was a downgraded second class, many firms today 
deliberately impair a good or service to create a lower quality version that can be sold 
at a discount price; these products are called damaged or hobbled goods. For instance, 
many commercial softwares have a light version with restricted functionalities or some 
media products are made artificially smaller (60 mn disc vs. 74 mn). In other cases, 
printers, computer chips or postal delivery services are artificially slowed. Finally, the 
introduction of computerized systems in automobiles has lead automakers to design mo- 
tors whose horsepower can be electronically reduced so as to offer a lighter but cheaper 
version of the car (which in several countries also qualify for lower tax). Likewise the 
onboard computer is always present but the display functions are empowered at assem- 
bly only if the client has paid for the options. A recent popular case is Apple's 33% price 
cut of the iPhone in 2008 on the day the iPod Touch was introduced. This apparatus 
is similar except for the absence of telephony service. This solved the cannibalization 
problem whereby consumers would abandon the pricey iPhone for the less expensive but 
almost as useful iPod Touch. 

Up to now, we have discussed a vertical differentiation based on quality; the other 
common form of differentiation is horizontal and uses the heterogeneity of consumer 
tastes, for instance that related to gender (men vs. women) or age (young vs. mature). 
Both types of differentiation are pursued with more detail in §11. 

4.3.5 Inter-temporal Discrimination 

Dupuit (1844)'s original example of differential pricing remains topical to introduce the 
time dimension: the wise editor of a novel can print a limited number of luxurious copies 
for the first edition and sell them at 40€ each; the second edition will come out immedi- 
ately for the general public and will be priced at 20€. A few months later, the publisher 
will issue a third edition by subscription at the reduced rate of 10€. The next year, a 
fourth illustrated edition will come out at only 5€, soon followed by a popular edition on 
cheap paper at 3€. Finally, a compact small print edition will be issued for 2€. All in all, 
the publisher will sell many more copies and earn much more than if he were to set a 
single monopoly price, for instance 20€. This scheme was not well received at the time 
and even called a scam. Dupuit (1844) countered that, thanks to the greater profits of 
the editor, the writer can be paid a decent wage and that many people have been able 
to enjoy his book so that everyone is better off; in other words the pricing scheme is a 
Pareto improvement upon traditional pricing. 



As opposed to flow goods like a beer or a train ticket whose consumption is synony- 
mous of destruction, a durable good lasts for at least two periods. Thus, once the mono- 
poly has sold to some consumers he cannot sell again to them later on. Our purpose is 
to see how this limitation can be overcome. We shall consider in turn several models of 
increasing complexity regarding the behavior of both the monopoly and the consumers. 
We say that customers are sophisticated if they anticipate any future price cut and can 
therefore postpone their purchase (if they wish). Inversely, customers are naive or my- 
opic if they have no memory or fail to anticipate changes in the pricing policy 30 ® We also 
distinguish whether the firm can commit or not to a pricing policy over the two periods. 
If not, she is bound to set a price optimal given the current condition i.e., the game is 
solved by backward induction. The more general interaction of commitment and price 
discrimination is the object of §4.3.6. 

Pricing for Myopic Consumers 

Intuitvely, the monopoly can sell the good to rich consumers (more precisely those with 
a high WTP) in the first period and reduce the price in the second period to reach poorer 
consumers. For instance, he could initially charge the monopoly price p^ corresponding 
to a once-for-all sale, then estimate the residual demand D for the second period and 
charge the updated monopoly price p^ (thereby fooling the memoryless consumers). The 
monopoly therefore earns n^ and a lower U^. 

Concretely we assume additive temporal preferences. The demand for one period use 
of the good is D{p) = 1 - p and the extra value of an additional period ownership is 5 
times the single period value where 5 < 1 is the obsolescence factor of the good common 
to all consumers. The market WTP in the first period for a once-for-all sale is Pi {q\) = 
{l + 6){l-qi). With constant marginal cost (normalized to zero), the monopoly sells q^ = \ 
at price pf 4 = ^ to earn nf 4 = i±£. The residual demand is D(p 2 ) = 1 - q™ - pz because 
the rich consumers have bought first (the sale is like an auctioning process); the market 
WTP is now Pziqz) -\- qz, hence the optimal second period sales are q!f = \ at price 
P 2 M =\<Pi yielding n% = ±. Total profit is thus 5±|*. 

The truly optimal strategy when facing myopic consumers is slightly different al- 
though it obeys the same intuition. Given initial sales c\\ (not necessarily half of the 
market), there is a residual demand, thus an optimal second period price p 2 . We then 
compute the reduced form second period profit n 2 (^i) and maximize the total profit 
n^!) + n 2 (<7i). Recalling that profits are always increasing with market size, n 2 (<7i) must 
be a decreasing function since more sales today means that tomorrow's market will be 
smaller. Hence, the marginal profit of the second period is negative, it acts as a marginal 
cost within the total marginal profit. Now, being faced with a higher overall marginal 



cost, the monopoly will optimally reduce its first period sales; therefore the optimal ini- 
tial price pi is greater than the monopoly price p^ 1 but the rebate for the second period 
is also greater. 

In our example, if q\ units are sold initially, the residual demand is D{p 2 ) -\-q\-p 2 , 
thus optimal sales are q 2 {q\) = |(l - <7i) at price p 2 {q\) = |(l - <?i) leading to second period 
profit ft 2 (<7i) = \[l- cji) . Recalling that the first period profit being (1 + <5)(1 - q\)q\, total 
profit is 

n = qiPiiqi) + fl 2 {qi) = (1 + 5) (1 - q x }q x + J (l - <n) 2 (4.7) 

The optimal initial sales, maximizing (4.7), are thus q\ = j^s < ^1* a ^ the P r i ce price 
Pi = P\{qi) - { \^28 > ^lf ~ P\- Since the monopoly restrict first period sales, it increases 
the second period ones as q 2 {q\) = 2 (\ + +28) > °$l- ^ s we previously claimed, the new rebate 
P\-p2 = -^ is greater than the old one p^ - p^ 1 = ^f^-. 

Summarizing, if consumers are myopic and don't consider the future when taking 
today's purchasing decision, the monopoly can basically sell the good to rich consumers 
in the first period and reduce the price in the second period to reach poorer consumers. 
In fact, the optimal strategy when facing myopic consumers is different because the two 
markets, today and tomorrow, are related. Recall indeed that periodic profits are in- 
creasing with market size and that more sales today means a smaller market tomorrow. 
Hence, increasing sales marginally today generates a marginal loss on tomorrow's prof- 
its. The monopoly has therefore an incentive to reduce today's sales with respect to the 
standard monopoly computation. He then offers a larger rebate in the second period to 
capture a large market share. 

Leasing 

Lowering prices in the second period means lowering benefits, hence a clever attitude is 
to turn the durable good into a non-durable one by leasing (renting) it in every period. 
This way, rich consumers are forced to pay twice the good while poor are excluded from 
consumption. Since the market WTP for the first period of use is 1 - q, the optimal first 
period volume is \ at the lease price \. In the second period, the good loses values thus 
the WTP is 6{l -q) so that the optimal second period volume is \ at the lease price |; total 
profit is thus ^. The key to this result is that by leasing the good, the monopolist retains 
ownership of all produced units and thus is not tempted to expropriate the owners of 
previous production by flooding the market because he would expropriate himself. 

Since the full monopoly outcome is trade restrictive and quite inequitable, it has been 
ruled out by the US antitrust authorities in several cases against Xerox in the 1960s 
and IBM in the 1970s (cf Waldman (1997)). More recently, the 1994 Consent Decree has 



prohibited Microsoft from offering long-term contracts (contracts with duration longer 
than one year) for operating systems. A strategy similar to leasing is the repurchase 
agreement according to which, the monopoly accepts to repurchase all units previously 
sold if he comes to offer a lower price, thereby permitting old customers to buy again at 
the new and lower price. Almost identical is the most favored customer (MFC) clause 
which compensates old customers for any price rebate offered to new customers; the 
only difference is the continuity of ownership of the units previously sold. These two 
contractual arrangements give the same commitment power to the monopoly to maintain 
its price constant over time; they are often viewed as anti-competitive by authorities as 
we explain in §9.2.1. 

Pricing to Sophisticated Consumers 

We abandon the option to lease the good and come back to sale as ownership transfer. 
The myopic attitude is hard to sustain as consumers do observe the price cut. Some 
first period buyers may simply anticipate it and decide to wait in order to enjoy the good, 
later but cheaper. This anticipation forces the monopoly to reconsider its pricing strategy 
and adopt a dynamic perspective consistent with its clients expectations. The solution 
is found by applying backward induction: given what happened in the first period (be it 
coherent or not) how will the consumers and the monopoly act in the last period? 

Although consumers observe the price p\, they anticipate sales q\ < D{p\) because 
some people will delay their acquisition. Consumers are rational and know that the 
monopoly is rational too, hence the initial sale of q\ units will be followed by the optimal 
price p2{cji) = |(1 - cji)- To advance further, we need to delve into the roots of consumer 
demand. When initial sales are cj\, the margin buyer has per-period WTP l-q\, hence his 
utility for early purchase is (1 + 5X1 - q\) - p\ whereas he nets \-q\-p2 for late purchase. 
This is truly the marginal buyer if he is indifferent i.e., if p\ = pi(qi) = (| + 8) (1 - q\), the 
unique price consistent with consumers' expectations. Notice that p\ < P\ i.e, consumers 
by anticipating the future price cut, force the monopoly to distort the initial price down- 
ward. The total profit that the monopoly can now anticipate is 

n M (<?i) = qipilqi) + ft 2 (<7i) = {\ + 6) fc(l -q~i) + \{\- q x f (4.8) 

The optimal initial sales, maximizing (4.8), are q\ = ||^y > -^s = <7i i- e -> sophistication 
forces greater initial sales (using a lower price) because clients anticipate the future 
rebate; the rebate is also lowered since p\-p 2 = ^(1 -</*)< (| + 5) (1 - q\) = p\- p2- Total 
Profit is n* = ggg. 



Commitment 

If the firm has the ability to commit to an inter-temporal price strategy, she can avoids 
some of the loss due to consumers' sophistication by selling once for all the good to half 
of the population and earn i±£ > n* although this is less than the profit derived against 
myopic consumers. Commitment does not come cheaply, it must be build. An example of 
a successful reputation is Disney's strategy of time limited releases; every year a movie 
is released in video (now DVD) for the consumer market during 6 months with a publicly 
made promise to stop selling it for 5 full years. Since kids' interest for the feature is 
bound to disappear after that lapse of time, parents have no choice but to buy early at 
the high price set by Disney. This profitable one-shot strategy works if the seller keeps 
its word. Now, as Disney's catalogue of candidate for releases is large, it has a strong 
incentive to maintain this reputation. Similar issues of dynamic consistency are studied 
in §10.3 on foreclosure. 

Coase conjecture 

The above logic of falling prices and patiently waiting consumers leads Coase (1972) to 
conjecture that 

I If the monopolist can sell any amount and cut price quickly, her initial offer would be 
approximately marginal cost, thereby replicating the competitive outcome. 

The monopolist's problem is her inability to commit to maintain a high price as she 
competes with price-cutting future incarnations of herself. The mathematical proof of 
this result is highly sensitive to details and in particular to the service capacity which 
cannot be quickly adjusted upwards or downward. Mcafee and Wiseman (2008) show 
that under such a plausible capacity constraint, profit cannot be lesser than 30% of the 
rental monopoly profit (as shown in appendix). 

4.3.6 Behavior-Based Discrimination 

We leave durable goods and return to flow goods that are repetitively bought. 

Purchase History 

Many industries, including supermarkets, airlines, and credit cards compile individual 
consumer transactions, study the purchasing behavior and make tailored offers to in- 
dividual consumers, via direct (e)mail or other forms of targeted marketing. Over the 



internet, cookies, static IP addresses, credit card numbers, and direct user authentica- 
tion make this strategy ubiquitous. Historic discrimination is thus the conditioning of 
the current price on the purchase history. 

As with all intents to discriminate, firms have to deal with the reaction of consumers 
who can hide the fact that they bought previously or try to simply escape recognition. 
The reason why buyers reveal their private information to sellers is to allow them to 
design a better service which in turn makes a premium price justified. Typically, a small 
piece of information regarding preferences enables to design at low cost an addition, a 
customization to the baseline service which is of great value to the customer. For in- 
stance, we stick with the same doctors, lawyers, accountants, dentists, butchers because 
these professionals know our tastes and cater to them. 31@ Expedited checkout, online or 
in a shop, is available to loyal fully registered clients but comes at the cost of forfeiting 
"new-client" discounts. Similar customized services are recommendations, reminders, 
preferences. There are also purely monetary rewards to loyalty with coupons and fidelity 
discounts (airline miles). 

Model 

Armstrong (2007) considers the periodic demand is D{p) = 1 - p (individual values are 
uniformly distributed in [0;1]) repeated over two periods. The monopolist sets price p\ 
for the first period and prices P2 and p 2 for the second period according to whether the 
sale is new or repeated. Inter-temporal discrimination occurs if p 2 ^ p\ while historic 
discrimination occurs when p 2 ^ p 2 . 

Commitment Let us analyze first the case where the firm can commit in advance to 
its pricing policy. When facing naive customers, the firm optimally sets p 2 -\p\ for 
new buyers (this is the standard monopoly price over the low WTP segment) and p 2 = 
Pi for repeaters (the monopoly wants to sell to all people with strong WTP). We have 
second period profit n 2 = pi(l - pi) + \p\ [p\ - \p\) which is maximum at p\ = |. Yet, the 
real objective sums past and present profits. As n\ = p\{\ - p\), we have n = n\ + n 2 = 
\pi (8-7pi) which is maximum at p\ - | > p M and leads to profits n = |. 32@ Consumer 
surplus is |(1 - p\) 2 for the first period and ~ p ~P2) +pupi-P2) j n ^q second one (taking 
into account the dichotomy between first time and repeating buyers), overall consumers 
net Wd = Hj = 0.15 at the firm's optimum. 

When facing sophisticated customers, the optimum is to repeat the one-shot monopoly 
price p M = \ and earn n = \. Indeed, the arbitrage v - p x = v - p 2 for the indifferent buyer 
still holds, thus p\ = p 2 so that no time arbitrage occurs. Since p 2 is chosen at will, it is 



optimal to set p 2 - p\ for repeaters (as before). It is now clear that the common monopoly 
price is optimal. Consumer surplus is then W D = 0.25. 

Concluding, under commitment, the firm takes advantage of consumers' naivety to 
increase its profits. Since prices are higher welfare decreases which means that con- 
sumer surplus is drastically reduced (from 0.25 to 0.15). In a free market economy, a 
firm sets price as she wishes in any period, thus inter-temporal discrimination is always 
possible (though not useful if consumers are sophisticated). Now, historic discrimination 
may be unavailable for either legal or informative reasons in which case the firm stands 
to lose. 33@ 

Commitment Failure We now move to the case where the firm cannot commit to its 
pricing policy i.e., cannot help but offer potentially different prices to the two groups of 
early and late buyers, if she finds it at her advantage to do so. The failure to commit is 
inconsequential when facing naive customers since they fail to perceive commitments, 
hence the firm takes advantage of them exactly as when she can commit. 

Against sophisticated customers, the last early buyer is v > p\ because people with 
WTP close to p\ prefer to wait for the ineluctable price cut. The optimal second period 
prices are as in the Naive-Commit case, p2 = \v and p 2 = v. Since the sophisticated cut- 
off buyer is indifferent between early and late purchase we have v - p\ = v - p 2 . Thus, 
Pi = Pi = \pi, a strategy quite different from that seen before: the price asked to first- 
time buyers is low but the repeater's price is high as observed in many instances of web 
sales. Profits are then m = \piiX - pi) and n 2 = p 2 {l - pi) + \p\. The only difference with 
the Naive-Commit case (up the change of optimizing variable p\ by p 2 ) is the \ factor 
in Ti\, the solution is the greater price p 2 = 1 > I which allows the firm to earn n = Jjj < | 
(the commitment level). Consumer surplus is W D = j^ = 0.14. The lack of commitment 
power thus arms everybody since both the firm and consumer lose; they are trapped into 
a web of mistrust about price changes that generates a . If historical discrimination is 
unavailable, then the firm applies the one-shot optimal monopoly price twice. 34 ® 

Recent advances in marketing techniques together with the advent of the computer 
age, mean that the commitment problem has become worse. Indeed, the finely tuned 
customer data that firms often possess allow them to build personalized prices. Such 
prices are often "secret" rather than public, and it is unlikely that firms can commit 
to such prices. Moreover, even if firms could commit, the complexity of the linkages 
between consumer actions and future prices may be too complicated for many consumers 
to comprehend. 
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Strategic Interaction 



Chapter 5 

Imperfect Competition 

Oligopoly with Simultaneous Decisions 



This chapter is the building block of the book. We concentrate on duopoly models i.e., 
the competition among two firms. Although this may seem a limitation when trying to 
study the abyss existing between perfect competition and monopoly, most results of this 
chapter extend straightforwardly to any finite number of firms. It is also frequent to 
find the perfect competition outcome when the number of firms grows large which gives 
in retrospect a justification for the price-taking assumption in the perfect competition 
model. 

The main models of oligopoly competition view firms competing by choosing prices 
and quantities (or production capacities). As we shall see, the Cournot (1838) approach 
is adequate for markets where prices can vary faster than production capacities (e.g., 
wheat, cement) whereas the Bertrand (1883) approach is adequate when firms can adjust 
capacities faster than prices (e.g., software). We also study a new form of competition 
based on contracts whereby firms offer multiple combinations of price and quantity and 
let clients pick their preferred choice; this is relevant for markets where the demand 
changes unexpectedly. 

Our first section complies with the basic requirement for the study of strategic be- 
havior, the introduction to game theory, the tool permitting the modeling of rational 
interaction. The following section focuses on the Cournot model of competition through 
quantities whereas the next one considers the more direct competition through prices 
together with some comparison and extensions. Lastly, we present succinctly under the 
heading of contest, some of the scenarios where firms compete outside the market. 



5.1 Competition via Quantities: Cournot 

The idea that firms compete via the quantities they wish to sell was first analyzed by 
Cournot (1838) who modelized the competition among two producers of mineral spring 
water of identical quality with a common nil marginal cost. In today's world, these firms 
could be Danone and Nestle with their respective brands of mineral still water Evian 
and Vittel. 

5.1.1 Symmetric Cost 

Let us present the basic model in the following manner: farmers 1 and 2 come to the 
market to sell their harvest (e.g., wheat); their (maximum) supplies are fixed quantities 
q\ and q 2 . This is the crucial hypothesis of Cournot's model; it can be explained by 
a technological limitation like a production capacity or the crop enabled by this year's 
weather. Assuming further that the two goods are homogeneous, a single price must 
hold in equilibrium. 1@ If the equilibrium price p is such that demand D{p) = a-bp is 
lesser than the maximum aggregate supply q\ + q 2 , then at least one farmer fails to sell 
all of his units. He can then undercut p to secure maximum sales; as shown in the price 
competition section, this behavior improves his profit. We therefore conclude that the 
equilibrium price satisfies D{p) > q\ + q 2 . In equilibrium of the market, demand equals 
supply thus we have D{p) = q x + q 2 ; hence the market price is given by the willingness to 
pay of the market through the relation 

a- <7i - q 2 . _ . , 

p = P(q l + q 2 ) = 2i— 2± (5.1) 

b 

Like a monopolist, farmer 1 understands that his decision q x directly impacts the mar- 
ket price; the novelty of the Cournot model is the presence of the decision q 2 taken by 
competing farmer 2 inside the price equation (5.1). 

Let us assume that both farmers share the same constant marginal cost of production 
c. Farmer 1, being rational, is able to anticipate that when farmer 2 brings in q 2 , its profit 
as a function of its own quantity q\ is 

n ( {{qi,q 2 ) = qiP{qi + q 2 )-cqi (5.2) 

His objective being to maximize profits, he will produce the quantity equating marginal 
revenue to marginal cost, here c. It is interesting to note that the marginal revenue 

R m ,i = P(qi + q2) + qiP'(qi + q2) (5.3) 



is greater than for a monopoly (obtained by merging farmers 1 and 2). Indeed, when 
farmer 1 wants to expand production, he must give a rebate to his current clients, but 
these do not constitute the entirety of the demand, hence the cost of the rebate is lesser 
than for a monopoly who has to give it to the whole demand (check the difference with 
eq. 3.3). 

Using our linear demand, we compute R m ,i - a ~ qz ~ qi , thus the best reply of farmer 1 
to q 2 is the quantity q\ that maximizes nfiqi, q 2 ) i.e., solves 



Rm,i - c 



q^BR^iqz) 



a -be- q2 



(5.4) 



I In the Cournot duopoly, each firm behaves like a monopolist, only that the demand he 
faces is the market demand minus the (anticipated) supply of his challenger. 

Symmetrically, 2 @ the best reply of farmer 2 to q\ is q 2 - BR^iqi) = a ~ °^ qx . Both curves 
are shown on Figure 5.1 and their intersection defines the unique Nash equilibrium since 
it the only situation where each action (quantity) is a best reply to the other. Recalling 
the definition of the Nash equilibrium, each farmer takes a decision contingent on what 
he believes his competitor will bring. The equilibrium occurs when both conjectures are 
actually correct. 




Figure 5.1: The Cournot Web 



To understand the process by which an equilibrium is reached imagine that on day 
farmer 1 brings a small quantity x°. Then the next day, farmer 2 having observed x°, 
choose as a best reply to bring a large quantity x l = BR^ix ). In turn farmer 1 will react 
on day 2 by bringing x 2 = BR^{x l ). It easy to observe that the quantities converge to the 
intersection of the two reaction curves. To derive formally this intersection, we solve 



simultaneously the two equations q\ = BR^{q 2 ) and q 2 - BR^iqi); substituting one into 
the other we obtain 

Qi = q 2 =q c = °—^- (5- 5 > 

It is clear that when farmer 1 anticipates that farmer 2 will bring q%, he finds it optimal 
to bring q^ and symmetrically, when farmer 2 anticipates that he will face q^, he is 
rationally lead to bring q£. Hence both actions reinforce the anticipation held by each 
farmer. The Nash equilibrium point is said to be self enforcing. 3@ 

Replacing the result from (5.5) into (5.1), we obtain the equilibrium price: 

r r r a + 2bc ,_ „. 

p C = P{q? + q^) = —^- (5.6) 

leading to total sales of 2q c . Since cost are symmetric, the profit margin is the same for 
both farmers; using (5.6) and symplifying, we get 

r a-bc q c 

»- c= ^t-t <5 - 7) 

The last relevant magnitude is the (common) equilibrium profit which is computed 
using (5.5) and (5.7) as 

*? = n c 2 = q c (p c -c)= l - (q c ) 2 = ^^ (5.8) 

The Lerner index of market power (cf. eq. (3.4)) is if = ^-^ = ^~ 2 b b c c > °- ^ e consumer 
surplus is Wg = ^ [2q c f = ^ {a-bc) 2 (cf. eq. (2.19)) and welfare (cf. definition in §2.3.2) 
is thus 

r r r r 4{a-bc) 2 „ {a-bc) 2 .„ „ x 

W c = w£ + tiC + n^ = — -<W* = - (5.9) 

D 1 2 9b lb 

the maximum welfare achieved at the Pareto optimum where willingness to pay is equated 

to marginal cost. 4@ The welfare loss is to due the application of market power by the 

duopolists. The ratio of inefficiency is W * VV Y - 11% which is less than the 25% loss occur- 

ing when there is a single firm, the monopoly (cf. eq. 3.5). 

The equilibrium of the Cournot duopoly involves an intermediate outcome in terms 
of efficiency. The equilibrium price, aggregate sales, individual profits and market 
welfare are all found between their respective monopoly and efficient levels. 

In the following sections, we use the results of the basic Cournot model to address a 
variety of questions. 



5.1.2 Asymmetric cost 

Theory 

There many reasons for the marginal cost of the two firms to differ. In the case of farmers, 
the fertility of their plots of land is likely to be different. More generally one firm can 
have a better technology. 

When farmer 1 has a cost advantage with c\ < C2, 

the best reply (5.4) becomes q\ - c ^~ q2 and symmetrically q 2 = c 2 2 ~ q] ■ Solving this 
linear system, the equilibrium is 

q C = a-Hlc.-c,) and q C = a-b&K-cy) (510) 

The equilibrium price is p c = a+b f b +c ^ and the total quantity sold is Q = 2fl ~^ 1+C2) . The 
more competitive firm ends-up selling more than its competitor, the difference being 
q^- q£ = b{c 2 - C\). The profits of firms remain given by the same formula as before with 
n i~\ i^i) an d n i = \ i^i) > they are ranked like sales. 



Cost Edge 

To enable a comparison with the previous symmetric situation, let us assume that firm 
2's marginal cost is c 2 = c while firm l's is c\ = c-y where 7 is a cost advantage. Applying 
(5.10), we obtain q^ = c 3 +2 r and q% - a ~ 3" y thus firm 1 ends-up selling q^ - q% = -^ 
more units thanks to her cost advantage; she also earns a premium of n^ - n^ = |(gf + 

<? 2 c )r- 

I A marginal cost advantage traduces, within the Cournot model of competition, into 
additional sales and profit that are both proportional to the edge. 

Trade Policy 

Even when the world best's technology is available to all, local regulations or taxes can 
distort market outcomes. An interesting application of competition with asymmetric cost 
is trade policy whereby a government impinges on the competition between a national 
firm (#1) and a foreign firm (#2). For instance, it can impose a duty t upon every unit 
imported in the country. If the two firms have initially the same marginal cost c and 
sell the common output q = ^f- at price p = a+ ^ c , then the trade policy effect is to make 
C\- c and c 2 = c+t. We can immediately look at the equilibrium for asymmetric costs to 
derive conclusions. We observe that q^ = q + | while q% = <? - y , hence the national firm 
increases its sales at the expense of the foreign one (who sells less than before). Although 



this policy protects the local firm, it hurts local consumers since the equilibrium price, 
p c = p + 1, increases with the duty t and the total quantity sold, Q c = Q- f decreases with 
it. 

Switching cost 

Many of the products we consume are durable and generate a long lasting utility; being 
accustomed to them, we find it difficult to switch to an alternative even if it is function- 
ally identical. 5@ The following list reveals the extend of the issue. 

• Need for compatibility with existing equipment 

• Transaction costs of switching suppliers 

• Costs of learning to use new brands 

• Uncertainty about the quality of untested brands 

• Discount coupons and loyalty programs (aka volume discount) 

• Brand loyalty and/or psychological inertia 6 ® 

As reported in Farrell and Klemperer (2007), empirical evidence of switching costs 
has been found for credit cards, cigarettes, software, supermarkets, air travel, frequent- 
flyer programs, telephone, TV channels, online broker, electricity supplier, bookstores 
and automobile insurance. 

As we shall see with a straightforward application of the Cournot model, switching 
costs have a notable impact on competition and are sought after by firms. Assume that 
for all consumers, there is a switching cost s€ ; the willingness to pay for a new product 
P{q) is therefore diminished by this amount which appears as an additional marginal 
cost for the firm in its profit function. If only one firm, say an entrant (#2), is affected 
by this phenomenon, its full marginal cost is c 2 = c + s while the incumbent keeps its 
marginal cost at c\ - c. All of our conclusions on the distortionary effect of a tax can be 
accommodated to the present setting. 

5.1.3 Oligopoly 

Symmetric Cost 

Extending the duopoly analysis to oligopoly (n active firms) is needed to address the 
issue of entry and understand why some markets have only 2 or 3 active firms while 
others are characterized by a two-digit figure. 

In the analysis of the duopoly, we used the Q = q\ + q 2 decomposition of the total 
supply. For the oligopoly, we select one firm i and write Q = qt + Q-t where Q-t = T^j^t Q] 



is the total amount brought to the market by i's competitors. The individual profit is 
Kiiqd = tfiPiQ) - CtiqO and the best reply solves the FOC 

P{Q)-C m>i {q i ) = -q i P\Q) o Se t = - (5.11) 

e 

where St designates the firm's market share. In our linear example, (5.11) simplifies into 
the previously seen FOC (5.4), so that the best reply is simply q t = BR9{Q_ t ) i.e., when 
goods are homogeneous it does not matter who produces the remaining units that the 
market will absorb. Repeating this operation for all n firms, we obtain n equations for 
the n unknown quantities yielding a finite number of solutions (very often a unique one). 
We now show for the linear demand D{p) = a-bp that if the production technologies 
are identical (i.e., same marginal cost Cj = c for all i < n), then a unique symmetric Nash 
equilibrium exist. In a symmetric equilibrium, the individual quantities selected by 
firms are all identical to some level q, so that the best reply equation (5.4) needs to be 
applied to qi = q and Q-t = {n- l)q; we thus obtain 

q = a ~ bc -^ => q = q c ^^f- (5.12) 

which is smaller than its competitive equilibrium counterpart q* = ^—^ (case where all 
farmers are price-takers). The total sales simplify into Q c = -^ {a- be), the equilibrium 
price is p c = al ^" c , while the individual firm profit is n c = ¥,~+% , leading to industry 

„. nf tVip mnnnnnlv IpvpI 

(«+l) 2 



profits n c = nn c , a proportion 4n 2 of the monopoly level. Lastly, the Lerner index of 



„c 



market power (cf eq. (3.4)) is 5£ = &-^ = ^~^ c > which is decreasing with the number 
of active firms. 

When n becomes large, the competition in the Cournot set-up brings the price nearby 
efficiency since both p c and p* tends to zero (industry profit also shrink towards zero). 
Another nice property of the Cournot model is its neutrality with respect to market size: 
as can be checked from (5.12), if the demand doubles (multiply a and b by 2) then sales 
and profits also duplicate. We can now state: 

As the number of active firms increases in the Cournot oligopoly, total production 
increases in equilibrium, the equilibrium price decreases and welfare rises. If the 
equilibrium is symmetric, then individual sales and profits decrease. 

Asymmetric Cost 

We end with the derivation of the equilibrium in the presence of cost asymmetries which 
shall be useful for later chapters. In the linear example, the best-reply of firm i is 2qi = 



a-bci - Q-t for all i < n. Summing these equations we get 

2Q = 2Z i > l q i =Lj>i(fl- bet) -Li>iQ-i = n{a-bc)-{n-l)Q 

=> Q c = ^- 1 (a-bc) (5.13) 

where c = ^Ey>i c/ is the average industry cost. Lastly we use Qf. = Q c - g? inside the 
best reply equation to derive the individual quantity 



a-b{na-Tj^iCj) _ a- b((n + l)c,- - nc) 
n+l n + l 



qf= " " V "" 1 w^ = — "— ^—; (514) 



This step-by-step derivation of the equilibrium has also proven its uniqueness. 

Industry Shock or Taxation 

Nelson (1957) wonders what happens to the industry (and its individual members) if the 
government taxes the output, say, to address a negative externality such a pollution, or 
if a competitive input market suffers a negative shock such as higher wages or dearer 
energy. These shocks raise marginal cost and lead unambiguously each firm to sell less, 
whatever the market structure and the form of interaction. There is thus a price hike. 
Intuition would then suggest a reduction in the profit margin and total earnings. This 
is correct for the linear demand used in this book (check eq. (5.12)) but need not always 
be so. Indeed, since the shock applies to the entire industry, it may reduce the intensity 
of rivalry and may thus end up being beneficial. Obviously, the same paradox may apply 
in reverse to a positive shock i.e., a reduction of taxes, wages or energy prices may be 
detrimental to the industry. As shown by Seade (1985) (cf. §7.2.3), when an oligopoly 
serves an inelastic demand, a negative outside shock is always profitable. An example 
was the gasoline market of western countries when the oil price rose in the 1970s. 

5.1.4 Welfare 

Market welfare sums the consumer and producers surpluses W = W D + W S . It is well 
known that a social planner wishing to maximize welfare would impose the competitive 
price p* . Interestingly, Bergstrom and Varian (1985) characterize the Cournot outcome 
in a similar way: a social planner would choose the Cournot price if his objective was 
to maximize W = W D + ^ry W~ s i.e., if he would favor firms over consumers (twice in the 
duopoly case). 

To prove this claim recall first that WsiQ) = E/>i tti = QP{Q) - Lj>i Q (<//) so that dW$ = 



QP'{Q)dQ + Zi>i [P(Q) ~ C m ,i [qi])dqi while we saw in (2.19) that dW D = -QP'(Q)dQ, thus 

[n-l)dW = {n-l)dW D + ndWs = QP'{Q)dQ + nL(P{Q)-C mi i(q i ))dq i 

= I [qiP'{Q)dQ + {P(Q) - Cnjiqt)) ndq t ) = P'(Q)Y, qi{dQ - ndq t ) 

using FOC (5.11) for an individual firm p-C m j{qi) = -qtP'. In the symmetric Cournot 
game where the equilibrium is necessarily symmetric, we have qi = ^ so that dW = 
since ^ factorizes out and dQ = E/>i dqi by construction. 

This characterization echoes the regulatory capture (cf. §16.3 and §17.3.3) that is 
observed in most industries: the producer's lobby through its professional association 
succeeds to increase its weight in the governmental balance between demand and supply 
by arguing, for instance, that a strong industry can create jobs. This is indeed important 
but let us not forget that the consumer surplus for that market is a monetary measure; 
thus, the higher it is, the more consumers are able to spend in other productive sectors 
or simply save (to finance investment in the rest of the economy). Welfare is the right 
measure of efficiency because it treats all sectors of the economy equally for their ability 
to generate wealth to the benefit of all (cf. §2.3.3). 

5.2 Competition via Prices: Bertrand 

Actual examples of markets for homogeneous goods where firms compete in prices are 
the online markets for software, music, movies, computers, electronic appliances, or 
books, to name a few. 7@ The conditions for perfect competition are satisfied thanks to 
the multitude of price comparison engines and shopping bots that enable potential buy- 
ers to compare most prices within minutes in order to buy from the cheapest seller. 

The same idea is turned upside down by firms who are large consumers of standard- 
ized products such as paper, toner or pens. They set up a periodic procurement auction 
for the specific items they desire such as "A4 paper of 80g/m 2 in 500 sheets packaging", 
how many units they need and the maximum price they are ready to pay. When the auc- 
tion starts, potential providers offer lower and lower prices until one remains the most 
inexpensive and win the right to provide the batch for the proposed price (cf. §22.1.1 for 
a detailed analysis of auctions). 

As intuition suggests, these markets are extremely competitive because firms are 
forced to name low prices to keep their customer base; the outcome is then propitious to 
consumers but unfavorable for firms who end up earning meager profits. In the following 
section, we study from a theoretical point of view, how this competition develops, who 



wins it and how losers might do something to avoid their fate. 

5.2.1 Pure Competition: Homogeneous Goods 

Bertrand Paradox 

The quantity based approach previously used to study oligopoly does not apply when 
price is the main variable by which firms compete to attract consumers. If two shops 
located side by side sell the same good or service (e.g., two bars with the same design 
and range of drinks), then consumers will obviously patronize the one displaying the 
lowest price. Hence, each firm is lead to undercut her competitor in order to serve the 
whole market. This process stops only when one of the firms starts losing money (in the 
economic sense). If the marginal production costs are c\ and a larger c 2 , the equilibrium 
is p 2 = c 2 and p\ = c 2 - e where e is the smallest monetary denomination. Indeed firm 2 
would make strictly negative profits if it were to match p x or even propose less. Likewise, 
firm 1 who is serving the whole demand and earning c 2 -c\-e per unit sold would lose 
half of its customers with p\ = c 2 and all of them with a larger price, hence it is better off 
this way. 

An example of such a "cut-throat" behavior is empirically validated by Koerner (2002) 
in the German market for coffee even though this market is highly concentrated (six 
firms account for 90% of total sales). To find other examples, it is enough to search google 
news for the expression "price war" to observe its constant appearance in many different 
sectors of the economy. This result was called a paradox by Bertrand (1883) because 
although there are only two firms which are not price-takers, they end up proposing 
a price nearby the competitive price (exactly when c\ = c 2 ). Chicago economists in the 
1960s turned the Bertrand Paradox into the slogan that "two is enough for competitive 
outcomes". 

Edgeworth Critique 

Edgeworth (1925) is neither satisfied by Cournot's model nor Bertrand's critic; he ob- 
serves that constant marginal cost is only an approximation and is only valid for a small 
range of production. Taking these elements into consideration, it is easy to see that the 
undercutting reasoning does not hold anymore. Consider for instance the marginal cost 
curve displayed on Figure 5.2. This technology enables production at constant cost c up 
to the capacity A; of the plant; for larger quantities, the unit cost increases by an amount 
8 that represent extra-hour wages or a more expensive energy. 

The critique reconsiders the previous analysis as follows: starting from a common 
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Figure 5.2: Increasing Marginal Cost 



price p where firms share the market evenly (50% each), firm 1 may consider undercut- 
ting its rival with p x < p. Initially 100% of the consumers will rush to firm 1 but because 
its sales will double, it will refuse a fraction 100 - k of the new customers who will then 
turn back to firm 2. The latter being already the most expensive can then raise its price 
to compensate for the loss of some of her customers. This process of competition is quite 
difficult to study but it is clear that firms do make positive profits since their marginal 
cost is increasing. Indeed, the price equates the cost of last and most costly unit so that 
all infra-marginal units either make a profit or avoid at least losses. 

To illustrate this critique, we can go back to Cournot's original example of competition 
between to mineral spring waters like Danone's Evian and Nestle's Vittel. The flow of a 
spring being limited by geological conditions, the production of bottled water is likewise 
limited and this rationalizes the Cournot approach to competition. Nowadays, major 
players in the beverage sector like Coca-Cola, PepsiCo and Nestle sell a purified and 
re-mineralized tap water under the names of "BonAqua", "Aquafina" and "Pure Life". 
Competition is then no more limited by production capacities and the Bertrand model 
applies. To address quality concerns in Europe regarding bottled tap water, these major 
players now sell spring waters of similar taste and composition under a common name 
such as Nestle's Acquarel, Danone's Dannon (US market) or FontVella (Spain). 

Kreps and Scheinkman (1983) and later Boccard and Wauthy (2000) show that if 
firms choose production capacities (ki,k 2 ) and later compete in prices (pi,p2) then, in 
equilibrium, there is no excess capacity (demand equals the sum of capacitites) and the 
price lies between the Cournot price p c and the competitive one depending on the level 
of the extra cost 6. More precisely, p = min{c + 5,p c } and capacities are previously chosen 
so that total supply q\ + q 2 equates demand for that price. 8@ Hence we may conclude that 



Competition via quantities (a la Cournot) is a synthesis of a more complex dynamic 
competition involving the choice of technologies and a competition via prices (a la 
Bertrand). If building capacity is costly, then Cournot (quantity) is the right model, 
otherwise Bertrand is a better description of duopoly competition. 

Empirical Validation 

The above prediction is validated by recent empirical research. Goolsbee and Cheva- 
lier (2002) study the competition between online booksellers Amazon and Barnes&Noble 
(BN); they show that a 1% price increase at Amazon reduces quantity by about 0.5% at 
Amazon but raises quantity at BN by 3.5%. Given that Amazon sells much more books, it 
is as if every customer lost by Amazon was buying from BN. On the other hand, raising 
prices by 1% at BN reduces sales about 4% but increases sales at Amazon by only 0.2% 
i.e., the lost customers from BN do not switch to Amazon, most of them are lost alto- 
gether. Studying online retail competition for books, cameras and printers among firms 
with groups of loyal consumers of varying size, Kocas (2005) show that cheap retailers 
compete with more randomized prices to capture switchers (non loyal consumers) while 
dear retailers prefer charging the higher reservation price with less randomization to 
maximize their profits from their loyal segments (the size of loyal client base plays the 
role of capacity). 

Another illustration is based on the market for Video Game Consoles in the US 9@ 
shows that both phenomena can take place sequentially. The three contenders are Sony 
with the Playstation2 launched in October 2000, Nintendo with the GameCube launched 
the 15th of November 2001 and Microsoft with the X-Box launched 3 days later. 10@ As 
one can observe from Table 5.1, the high Christmas prices the first year are explained by 
the bundling of consoles with games and also by the shortage of supply (which may have 
been voluntary). Shortly after Christmas 2001, both Nintendo and Microsoft reduces 
their prices and it took 5 months for Sony to match the lowest price. This immediately 
triggered a further price reduction by Microsoft in order for its product to remain the 
cheapest of the market. But just one week later, Nintendo decided to cut another 50$ 
from its price to recover its position of "best bargain". Undercutting has been going on 
until the products were gradually replaced by the next generation of game consoles. Total 
sales were respectively 35, 14 and 11 million systems for Sony, Microsoft and Nintendo. 
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Table 5.1: Evolution of Prices for Video Game Consoles 

The battle for e-book readers displays a similar trend. Sony introduced a crude Japan- 
only model in 2004 at 368$, then PDF e-readers in 2006 at 349$, in 2007 at 299$, in 2008 
at 399$ (later reduced to 359$), in 2009 at 299$. The market took off in November 2007 
with Amazon's 399$ Kindle. A new model was introduced in February 2009 at 359$, a 
modest 10% discount probably much less than the cost savings allowed by 15 months of 
experience (a long time in the electronic industries). Amazon then lowered the price to 
299$ in July and further down to 259$ in October anticipating the release of the competing 
Nook the following month at the same price. When the Nook price was cut to 199$ in June 
2010, Amazon did not wait a day to undercut its competitor with a price 189$. In July, 
the third version of the Kindle (3G) appeared at the same price. Sony lowered its price 
to 169$ in July 2010. 

5.2.2 The Hotelling model of Differentiation 

Differentiation in all its dimensions (horizontal, vertical) is studied thoroughly in chap- 
ter 11. We content here to introduce Hotelling (1929)'s simple elucidation of the Bertrand 
Paradox. 



Setting 

Upon observing that many people buy from many different shops charging different 
prices, this author reflects that if a seller starts to increase his price, he will start to 
lose some customers, but not all of them as perfect competition would predict. Secondly, 
the perfect competition framework does not say anything about where would go the cus- 
tomers lost by the seller; shall they refrain from consuming or buy from a competitor ? If 
product are perfectly independent, there won't be any spillover in the sense that the com- 
petitor will receive no additional clients while if products are perfect substitutes, as in 
Bertrand competition, there will be a maximal spillover since the competitor will recover 
all the clients. The issue is thus to determine the own price and cross price elasticities of 
demand. This is precisely what Hotelling (1929)'s construction is about. 

He considers a city with a single boulevard; two movie theaters are located at the 
outskirts, theater A standing on the west side and theater B on the east side; the distance 
between the two theaters is the unit of distance. Both show the same selection of movies 



at respective prices pa and ps (all prices are labeled in the same currency). Moviegoers 
are ready to pay p to watch a movie but suffer from walking to any of the theaters and 
value that loss at t per unit of distance. The alternative situation that Hotelling (1929) 
had in mind was of a transcontinental train line linking two harbors located on the east 
and west coast of the US; firms located inland can buy foreign merchandise in any harbor 
but must pay for the transportation cost back to their hometown. 

Model 

The length of the city boulevard is set to unity. A moviegoer living at point x e [0;1] 
obtains a surplus or utility ua(x) = p-pA-tx from buying at theater A, a utility Ub (x) = 
p-pB-t{\- x) from buying at theater B and a zero level of satisfaction (u = 0) if staying 
at home watching TV (alone). Figure 5.3 shows in plain lines the utility levels u A and ub 
as a function of location x (beware of the differing vertical axes). At a given location x, 
three levels are compared and the highest one indicates the optimal choice. The prices 
giving rise to the left panel of Figure 5.3 are sufficiently low in order that everybody goes 
to the movies. 11@ The upper envelope of the three curves ua,ub,uo is the bold dashed 
kinked line on the left panel of Figure 5.3. Notice that consumer x for whom ua = ub 
is indifferent about which theater to patronize; the precise address of this indifferent 
consumer is 
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Figure 5.3: Price Competition a la Hotelling 

One easily guesses that movie lovers leaving to the left of x go to theater A while 
those on the right go to theater B. To ease the derivation of demands, we assume a 
uniform distribution of consumers along the segment [0;1]; this way a location x is also 
the proportion of consumers living to the left of x. The demands are thus Da - x and 



Db = 1 - x. Assuming a marginal cost c for selling a ticket, the profit of theater A is 

7T A (p A ,p B ) = {pA-c)x = {p A -c) 1- — - (5.16) 

and a symmetric formula for theater B. The best reply of theater A against her competi- 
tor's price pb maximizes n A i.e., solves 

dn A t + c + pB , „ 

- — = o p A = (5.17) 

dp A 2 

The best reply of theater B against the price p A quoted by his opponent is entirely 
symmetric. 12@ Both are displayed on the right panel of Figure 5.3. Notice that contrary 
to the Cournot case, the best reply functions are increasing which leads to say that prices 
are strategic complements i.e., the higher my opponent's price, the higher I shall set 
mine. 

A Nash equilibrium of the pricing game is then a pair of prices that are best replies to 
each other i.e., at the intersection of the best reply curves on the right panel of Figure 5.3. 
Because of the symmetry of the best reply functions, the equilibrium is also symmetric 
i.e., both theaters name the same price. To find this equilibrium price, it is enough to 
solve (5.17) for p A = p B which yields p h = t+c. If the transportation cost was nil, we would 
fall back on the Bertrand outcome where both theaters price at their marginal cost. 13@ 
To conclude, 

The existence of the transportation cost for consumers enables competing firms to 
escape from the price war leading to the Bertrand paradox. They secure profits with 
clients living nearby; thanks to the heterogeneity of the market, each firm is able to 
behave as a local monopoly. 



5.2.3 Duality of Cournot and Bertrand 

Singh and Vives (1984)'s duopoly model generalizes the Hotelling model of differentia- 
tion to enable a comparison of the Bertrand and Cournot framework of competition. 

Price Competition 

We now proceed to present the model and derive these results. Shubik and Levitan 
(1980) show how to build a simple model of consumption (cf §2.2.1) in which the demand 
addressed to firm i = 1,2 is 

qt = D{pi,pj) = a-bpi + d{pj-pi) (5.18) 



where the coefficients are the market size a > 0, the price elasticity b > and the degree 
of differentiation or substitutability d. The two goods are independent for consumers if 
d = 0, substitutes if d > and complements if d < 0. 

Let us concentrate on the most frequent case of substitutability (d > 0) and on zero 
marginal cost to keep formulas simple. 14@ When firms compete in prices, each maximizes 
PiD{pi,pj). Solving the FOC, we obtain the best reply 

„ a + dvj 

Pi = BRf{pi) = ^ (5.19) 

F l yj 2(b + d) 



for i = 1,2; it is displayed by the plain lines on Figure 5.4. The symmetric Bertrand 

a 

2b+d 



equilibrium solving (5.19) is p B = jj^ and using (5.18) we obtain q B = {b + d)p B . 



Quantity Competition 

Let us now study the quantity competition i.e., the case where firms choose quantities 
and prices are determined by the market clearing conditions. The first step is to invert 
system (5.18) in order to express prices as functions of the choice variables. We transform 
the first equation into {b+d)pi = a-qi+dp2 and the second into p2 - °~ q b 2 + d Pl ', plugging the 
latter into the former, we obtain {b + d)p\ = a-q\ + -^ [a- q 2 + dpi) thus p\ {{b + d) 2 - d 2 ) = 
{b + d)(a- q x ) + d{a - q 2 ) and finally Pl = ^ + 2d ] -ib+d)q l -dq 2 _ Letting ^ = ^d__ >5 = ^A^ 

and a = a{/3 + 6), we can write for i = 1,2 

Pi = P{qt, qj) = a- fiq, - 8qj (5.20) 

Now that firms compete in quantities, each maximizes qiP(qi,qj). Solving the FOC, 

we obtain the best reply 

a-Sqj 
qt = — (5.21) 

for i = 1,2. The symmetric Cournot equilibrium found using (5.20) and (5.21) is q c = j^§ 
and p c = /3q c . 

Merging the two approaches 

It is not possible yet to mix graphically the two regimes of competition in order to com- 
pare them because the Cournot best replies are quantities relationships. However, we 
can use the demand equation (5.18) to transform the quantity best reply (5.21) into a 
price relationship; we have 

2)6 [a- [b + d)pi + dpj] = a- 6 [a- {b + d)pj + dpt) 



> Pi = BRf( P j)= — J >BR?( Pj ) 



2(b + d)< 



(5.22) 



since a + dpj > (cf. (5.19)) and 2[ ^^_ d2 > ^d) ° 2 ^ + d "> 2 > 2 ^ + d ) 2 ~ rf2 is true - The 
curve corresponding to (5.22) is drawn, together with its symmetric, as a dotted line on 
Figure 5.4. 
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Figure 5.4: Bertrand vs. Cournot 

We immediately observe that the (symmetric) Bertrand equilibrium (where best replies 
cross) involves a lower price than the (symmetric) Cournot equilibrium. To derive a simi- 
lar ordering of quantities, we simply use the equilibrium relation q = a-(b+d)p to deduce 
q c <q B . We can now conclude that 

In a model of duopoly competition for differentiated products, consumer surplus and 
welfare are greater under Bertrand competition while profits are greater under Cour- 
not competition. 

As the substitutability factor d goes to zero, the difference in (5.22) vanishes (and 
both prices converge to the monopoly price since the two goods become independents). 
When goods are complements (d < 0), the best reply functions are decreasing (check in 
(5.19) and (5.21)) but their ordering remains the same so that equilibria are ranked as 
before. We can therefore say that 

I The Bertrand framework is always more competitive than the Cournot one, indepen- 
dently of whether the differentiated goods are substitutes or complements. 

The intuition underlying this important result is simply that firms perceive a higher 
elasticity of demand (in absolute value) under Bertrand than under Cournot competition; 



indeed in the first case it is [b + d)%- while in the second case the inverse elasticity is B I 1 

HI Pi 

leading to a smaller price-elasticity of i^- (independently of the sign of d). 15@ 

Choosing the mode of competition f 

The second result of Singh and Vives (1984) is that firms prefer to behave in a Cournot 
fashion (fix quantities) over a Bertrand fashion (fix prices) if they are free to choose. This 
is shown in a limited version of Grossman (1981)'s contract competition (cf §6.1.7) where 
firms can choose one but only one of the following contract with their clients: guarantee 
a price or a quantity 16 ® This leads to a simple 2x2 matrix game illustrated on the right 
pane of Figure 5.4 for which we already know the payoffs when the choices are identical 
i.e., [B,B) or {C,Q. Indeed the equilibria are shown on Figure 5.4 at the intersection of 
best reply curves and we know that firms prefer high equilibrium prices. We only have 
to study the case of heterogeneous choices to be able to conclude. 

In case of heterogeneous choices, the price player, e.g. #1, sets his price p\ to maxi- 
mizes piq\ where pi = a-Bqi -8q 2 due to the commitment of firm 2 over q 2 ; hence firm 
1 maximizes \p\{a -6q 2 - p\) and sets p\ - a ~ 2 q2 . Likewise, the quantity player (#2) 
sets his quantity q 2 to maximize q 2 p 2 where q 2 = a- [b + d)p 2 + dpi, hence maximizes 
j^q 2 [q 2 -a- dpi) and sets q 2 = a+ 2 Pl . Once again these best reply cannot be drawn di- 
rectly in the prices space, thus we use equations (5.18) and (5.20) to observe that the 
price player (#1) plays the Cournot best reply (expressed in prices) pi = BR^{p 2 ), while 
the quantity player (#2) plays the Bertrand best reply p 2 = BR^ipi). The ensuing equilib- 
rium is easily identified on Figure 5.4 as the diamond while the equilibrium where the 
firms adopt the reversed roles is the large dot. The proof is now trivial: independently of 
firm l's commitment, firm 2 benefits from switching its own commitment towards quan- 
tity (C) since the final price increases from B to ♦ against a price player and from • to 
C against a quantity player. This preference is shown by the small arrows in the matrix 
game of Figure 5.4. The conclusion we can draw from this analysis is that 

I Firms are likely to favor investments or technology choices that limit or slow down 
their price reactiveness i.e., turn them into less aggressive competitors. 



5.3 Contract Competition 

When firms compete for clients by offering multi dimensional contracts involving price, 
quantities and quality features, the distribution of consumers among firms obeys the 
maximum expected utility principle. Taking into account the intrinsic differences be- 
tween rival suppliers (cf. differentiation), consumer flock towards that offering the great- 



est level of satisfaction. This process is a generalization of Bertrand competition for an 
homogeneous product. In this section, we illustrate this process with several examples. 

5.3.1 Supply Function 

We first inquire a form of competition lying between the Bertrand and Cournot bench- 
marks. Recall indeed that in the former, a firm proposes a price and agrees to supply any 
quantity whereas in the latter, a firm commits a quantity but agrees to whatever price 
the market dictates. Many electronic markets are organized in a less extreme manner, 
as firms are required to propose quantity-price contingent plans i.e., blocks of units at 
specific prices. 

A Contractual View 

Grossman (1981) shows how Cournot and Bertrand competition can be related within a 
common framework putting contracts at the center stage and how the whole competition 
process transmits to the entry decision. 

Under Cournot competition, the monopoly quantity may be large enough to make the 
price lower than the entrant's average cost. In that case, the entrant stays out and the 
incumbent enjoys monopoly rents. In this story, when the incumbent announces he will 
supply the monopoly quantity, the potential entrant believes that there is no way to steal 
his customers. How can she hold such a belief? After all, the challenger could advertise a 
contract guaranteeing a lower price to customers accepting to switch provider; this way 
it would be possible to seize a portion of the incumbent's market share. Yet, the latter 
will himself react to this threat, for instance, by signing an exclusive dealing agreement 
with his customers (an example of which is studied in §9.2.1). In that case, the entrant 
is right to believe she cannot expand her market share. 

Notwithstanding, if such binding exclusivity agreements are impossible to write or 
too costly to enforce or are simply illegal, then the entrant can effectively get the whole 
market with her undercutting price. We are therefore back into the Bertrand story and 
the ensuing price war. Monopoly is not anymore an equilibrium outcome since the en- 
trant will step into the market. To sum-up, Cournot appears to be the right competition 
framework if binding contracts with customers are available, otherwise Bertrand is the 
right approach. 

Credible Contracts under Certainty 

Grossman (1981) then adds Edgeworth's original critique of Bertrand competition: who- 



ever pretends to sell a large quantity at a price lower than her own average cost goes 
bankrupt; in other words, a seller will ration potential clients to sell only profitable 
units. 17 ® The consumers may therefore anticipate that the total quantity supplied by a 
firm will, in fact, be a function of the lowest price in the market. A typical contract will 
be an exclusive dealership for a quantity q at a given unit price p which does not leads to 
bankruptcy i.e., such that p > AC{q) and the guarantee of meeting any lower price that 
would be posted in the market. A general contract is thus a supply function q{.) linking 
prices to quantities and satisfying the no-bankruptcy condition. The Cournot conjecture 
is the extreme case where the supply function is flat (q{p) = q,Vp) while in the Bertrand 
case it is vertical (p{q) = p,\lq). 

Within this framework, Grossman (1981) shows that the outcome of perfect compe- 
tition is also an outcome of the supply curve competition and conversely, that a supply 
curve equilibrium yields a perfect competition outcome. Surely, this is an elegant re- 
sult but of scant use to draw policy implications because too many things can happen; 
indeed, the game at stake between the firms has too many equilibria. A refinement or 
more ambitious theory is needed to disentangle the issue. 

Uncertainty 

The reader certainly recalls that the supply of a perfectly competitive firm is a (supply) 
function while firms with market power propose a quantity-price pair (in fact either 
of them since the other one is determined by the residual demand of the firm). The 
reason for this discrepancy is well known but worth restating here: the competitive firm 
ignores what the equilibrium price will be, thus she prepares herself for any contingency, 
for any possibility that might arise and offers one quantity for every single price that 
may become the equilibrium price. The firm with market power, on the other hand, 
anticipates the market demand and the supply of her competitors (if any); she is thus 
able to estimate with exactness her residual demand, so that she only has to pick the 
optimal price or quantity that maximizes her profit over that demand. 

The observation made by Klemperer and Mayer (1989) is that many market situa- 
tions involve a degree of uncertainty regarding demand, at least at the time where the 
firm's strategy must be committed. One fundamental reason, studied in Part F, is the 
internal organization of firms: adjustment cost, communication failures or managerial 
incentives combine to force top management to set fixed rules of marketing for the lower 
levels of management; these decisions implicitly determine a supply function. Examples 
are services such as consulting, law, software programming or any other qualified activ- 
ity sold by specialized firms; they often quote a per hour price but the real allocation of 
people on tasks by the general manager depens on the global amount of business. If there 



is some slack then more hours are worked than billed but if business is tight, all extras 
such as travel time or intial negotiations are added to the bill. All in all, the greater the 
contracted time, the greater the average price, that is to say, the policy designed by the 
general manager gives rise to a supply function. 

Another domain where supply functions naturally appear are the markets for ser- 
vices with unexpected variations of demand such as electricity, 18 ® hotels or airplanes. 
Producers cannot bid ever changing quantity-price pairs that adjust in real time to the 
changes in demand; rather they must submit a series of strategies that cover all relevant 
possibilities. In the case of electricity, a firm bids a series of pair (qi,pi), {qz,pz),{q-i,pz),... 
where the first q\ units are offered at the unit price p\, the next q 2 units are offered at 
the (greater) unit price p 2 , the next q 3 units are offered at the (still greater) unit price p 3 
and so on (these quantities often match the capacities of plants). The average price for a 
production of units q < q x is p u while it is p^+u?-^ for q x < q < q 2 , p^+^+^-^ps 

q q 

for q 2 < q < q-i (and so on). The relation between average price and quantity is a supply 
function s{p) stating how much the firm is willing to supply for the unit price p. 

Klemperer and Mayer (1989) prove that the supply function equilibrium is unique 
and lies between the Cournot and Bertrand outcomes. 

Demand uncertainty reduces the market power of active firms; they are forced to pro- 
pose supply functions rather than fixed quantities; as a result, the equilibrium is more 
competitive. 

Formal Analysis f 

To study formally the competition between two firms in the presence of uncertainty, it 
is enough to assume that the market size parameter a in the demand D{p) = a-bp, 
is unknown to the firms at the time where they set their pricing strategy. The com- 
mon technology of firms displays DRS i.e., the cost function is increasing convex C{q) = 
F + 6q 2 /2. 19@ Firms choose their supply functions s ; and Sj before learning the shock pa- 
rameter a. Since both supply functions are increasing, their sum is also increasing while 
the demand is decreasing; hence there is a unique market price p equating demand and 
supply i.e., solving a-bp = Siip) + Sj{p). 20@ 

Given the strategy Sj of the opponent, the residual demand for firm i is a-bp- Sj{p) 
so that her profit is ntip) = p{a - bp - s j{p)) - C [a - bp - s j(p)) and the FOC characterizing 
the optimal price is then 

= a-bp- Sj(p) - [p-8[a-bp- Sj{p))) [b + s'A 



a-bp- Sj{p) 

=> s jW + b = xr \ < T\ (5 - 23) 

J p-o[a-bp- Sj{p)j 

Equation (5.23) determines an optimal price p l a that depends on the current shock a 

together with a quantity q' a = a- bp l a - Sj{p l a ). As the parameter a varies, the pair [p l a , q l a ) 

describes a curve in the price-quantity space which is firm z"s optimal supply function 

s*{p); it catches the idea that for every possible macro-economic contingency, there is an 

optimal response to the competitor's supply function. Notice that equation (5.23) can be 

written 

s'(p) + b= $i [ P \ - (5.24) 

J p-5si{p) 

The same analysis for the other firm yields an optimal supply function s*Ap). Let us 
look for a linear symmetric supply function (SF) equilibrium i.e., where S{[p) = Sj(p) = 
a + ftp thus (5.24) becomes 

p + b= a + P p o[{p + b){l-6p)-l3}p = a{6(P + b) + l) (5.25) 

p-oa-o(5p 

Since the polynomial in (5.25) must be nil for all p, it must be the nil polynomial i.e., 
all its coefficient must be zero. From the LHS, we derive 21 ® (5 - \ l-b+Vb 2 + 4b/5\ > 
while the RHS yields the necessary condition a = because 8{/3 + b) + 1 ^ 0. 

The slope of this equilibrium supply function is intermediate between the Cournot 
and Bertrand cases which are respectively flat and vertical. Finally, the equilibrium 
price solves a-bp = S{{p) + Sj(p) = 2a + 2/3p so that p SF = t^j- The Bertand equilibrium 
price cannot be determined here due to Edgeworth cycles; however when 5 goes to zero, 
marginal cost tend to zero, the SF equilibrium slope /3 tends to infinity driving the SF 
equilibrium price towards zero i.e., we are back to a Bertrand-like outcome. The Cournot 
price can be taken from equation (6.8), it is p c = l^'jfg] which is greater than the SF 
equilibrium price. 22@ 

5.3.2 Clubs 

The economic notion of a club good, as reviewed by Scotchmer (2002) usually refers to 
public goods and public services jointly consumed. 23 ® The Public Economics approach 
takes a cooperative and pro-active view and concentrates on the formation and stability 
of the club together with its financing under a series of geographical or jurisdictional 
constraints. 

From our perspective in this book, clubs relate to public utilities (cf §17.1.2) and pri- 
vate services building on a network of customers such as recreational or cultural clubs. 



From an analytical point of view, competing clubs offer contracts to their prospective 
members with a view to raise their utility and sway their endorsement. We are thus 
faced with a particular brand of oligopolistic contract competition. 

Tradition sees club goods as services naturally displaying a positive externality through 
network effects. Yet, they may also suffer a negative externality, congestion when mem- 
bership grows too much because at any point in time, a club has a limited capacity of 
(instantaneous) service (cf. §25). From the industrial organization viewpoint, a club is 
more than a local association (book, wine, chess or sport club), it includes any service 
delivered through a minimum size facility (e.g., movie theater) or infrastructure (e.g., 
highway). At the heart of any club formation is the recognition that members confer 
positive externalities onto each other but also that a minimum (critical) size must be 
reached for two reasons: to start-up the network effect and to finance of the fixed cost of 
club creation. By its very nature, a club can serve many users simultaneously at near 
zero marginal cost; the club service is then "non rival", which is why it resembles a public 
good (but different as the latter holds the further property of being "non-excludable"). 

From the point of view of a user, aggregate membership (of the club) is a quality 
attribute. At low levels, the relationship may be positive because we enjoy better service 
in community than alone, 24@ but for levels close to capacity, congestion kicks-in and 
aggregate use undoubtedly becomes a negative quality attribute. Yet, rather than being 
exogenously chosen by the service supplier (cf. §3.3), aggregate use is endogenously 
determined by the equilibrium behavior of potential users. The thrust of the problem lies 
over the congestion range because the manager, whether a welfare or profit maximizer, 
will always try to motivate use or membership close to capacity level since it raise utility 
(thus WTP) while cost is barely affected. 

To study this issue, Scotchmer (1985b) assumes that competition among clubs is fierce 
enough to maintain the overall price of the service below the WTP of consumers so that 
the market is covered in equilibrium. One can then inquire about how many firms will 
be active, how many clients will they have and how much will these consume. Modeling 
depends on whether the facility size is stretchable or equivalently whether investment is 
lumpy. The dichotomy could be between a golf club (fixed size) and a tennis club (variable 
size) or between a highway and an optic fiber network. In any case, we may define the 
willingness to pay for the service (WTP) and the willingness to avoid congestion (WTA) 
which are assumed decreasing and increasing, respectively. Hereafter, total WTP and 
WTA sum these monetary valuation over club members. 

We develop the variable capacity case. Utility u[k,n) depends negatively on member- 
ship n (crowding) and positively on the facility size k whose (per period) cost is C[k). We 
have WTP = Uk and WTA = -u n . Let us derive first the efficient organization. 



The welfare objective is to maximize the utility of a representative consumer net of 
his fair share of cost i.e., W = u{k,n) - ^jp. The club size FOC yields the Samuelson 

equation 25 ® 

dW 

= o C m = nu k o MC = TWTP (5.26) 

dk 

while the membership FOC yields 26 ® 

^ = o 9±® = _ nUn AC = TWTA (5.27) 

on n 

The solution of (5.26) is k* n ; plugging into (5.27) , we derive the efficient membership 
n*. To cover the entire unit mass population efficiently we thus need -^ clubs. 

Let us now consider m competing clubs independently choosing their size A; and mem- 
bership fee /. Given these, consumers allocate themselves among clubs, seeking the 
greatest utility. In equilibrium of this process, all utilities are equalized to u. 27@ If 
a club has n members, each derives utility u{k,ri) - f = u and the club's profit is then 
n= nf-C{k) = n{u{k,n)- u)-C(k). The capacity FOC is 

df 

C m = n— = nu k (5.28) 

dk 

since capacity A; has no direct bearing upon utility elsewhere it,. The optimal size, con- 
ditional on membership, is the efficient one k* n since (5.28) reduces to (5.26). This is an 
important property of contract competition: once buyers become customers, their rela- 
tionship with the firm is Pareto efficient because the fee can be used to share welfare 
so that other variables can be set so as to maximize welfare. 28 ® Lastly, the membership 
FOC is 

f=r£f- (5.29) 

dn 

If a club wishes to attract more clients, she is bound to increase local congestion thus 

she must reduce her fee to avoid loosing all of her clients. Furthermore, as competitors 

receive now less clients, congestion is reduced and satisfaction is increased at these clubs. 

This forces the deviator to further decrease her fee. This equilibrating process implicitly 

define the relationship between membership and fee. To uncover it, suppose all but one 

firm play a candidate equilibrium (/, k) while the remaining one adjusts / to receive n 

clients (having picked the size k* n in anticipation before). Other firms then receive j^j 

clients and the market clearing equation is u{k, j^zj)-f = u{k, n)-f. Differentiating wrt. 

membership base n yields 

-1 (t l-n\ , df df m 

-u n \k, \ = u n {k, ri)-— o — = u n (5.30) 



ffl-1 on on m-l 



at the symmetric equilibrium where all actions are identical. 

Under free entry, profits are gradually eroded and drive the fee tends toward the 
average cost so that using (5.30), (5.29) becomes 

= / = -nu n (5.31) 

n m-l 

The comparison with (5.27) is immediate: the equilibrium fee exceeds the congestion 
cost by the ratio ^j; the solution is thus lesser 29 @ than n* meaning that the optimal 
club size is smaller than required by efficiency. As an ultimate consequence, too many 
firms enter the market. 30 ® 

As in the standard Cournot competition, the market power afforded to firms by their 
peculiar mode of rivalry allows them to earn economic profit and thus attracts too many 
in the field. Note however that as the cost of operating a club falls (due to technologi- 
cal progress), the efficient club size falls i.e., the efficient number of clubs rises so that 
the ratio E] ^- converges to unity meaning that an almost correct number of firms will 
enter. The policy recommendation flowing from the model is to limit entry in facility 
based services (i.e., prone to congestion) only when the required investment for building 
a minimum size facility is so large that the efficient supply structure is close to a natural 
monopoly. 

In the golf club case (or amusement park), the service capacity is fixed but it makes 
sense to allow members to vary usage which can then be priced. The constrained Pareto 
optimality arising from contract competition allows firms to apply marginal cost pricing. 
In the end, competition builds on the membership fee only. The analysis then follows 
along the previous lines. Scotchmer (1985a) shows that the first-best and the free entry 
equilibrium equations are those characterized above. 

5.3.3 Platform Economics 

The novel "platform" terminology aims to convey the idea that competition for modern 
bundled goods differs sensibly from the standard modes of either price or quantity com- 
petition. 

Introduction 

As can be seen from the list of examples below, technological progress creates a new 
long-term relationship between a firm and its customers. What is at stake is a package 
made of a fixed asset, the platform, together a flow of services. 31 ® 

Game Console Device and game cartridges (Playstation, Gameboy, Xbox) 



Printer Device and ink cartridges (e.g. HP, Lexmark) 

Nespresso Coffee machine (build by several makers) and patented Nestle capsules 

iPod Portable device and songs bought from iTunes store 

iPad Portable device and digital contents bought from e-retailers 

Mobile Phone Device and usage plan 

TV Device and distribution plan for channels (cable, satellite, DVB) 

Health membership and plan with a health maintenance organization (HMO) 

Car Automobile and repair parts (mostly electronic) 

Software Initials release vs. upgrades and support services 

The key characteristic of a platform is that the asset and the services are comple- 
mentary in a way that technology makes unique and hard to emulate. This bestows the 
vendor of the platform an exclusivity for the sale of future services, at least until the cus- 
tomer terminates the entire relationship. Given this lock-in of customers, firms compete 
vigorously to sign them up with schemes colloquially referred to as "bargain-then-rip- 
off '. One also speak of "fore-market competition" followed by "aftermarket monopoliza- 
tion". Indeed, the asset is often sold at a price close or even below cost to lure clients 
but these eventually suffer high markups over services given that the "aftermarket" is 
monopolized. The phenomenon is so strong that it can even happen with a monopolist as 
shown in §4.3.2. Because firms compete for customer "before the market", the standard 
conclusions of price theory for competition "in the market" are altered. 

A crucial distinction for the analysis of platforms is whether the firm can commit or 
not to future prices for services. If so, the bilateral relationship between a "committed" 
firm and a client is Pareto efficient i.e., generates maximum welfare. Profits for the firm 
and utility for the client are then shared by sliding the initial asset price as a function 
of competitive pressure in the market and outside opportunities for the client. The most 
frequent case, however, is when firms are unable to commit to a future pricing behavior; 
they cannot help but monopolize the aftermarket with high service prices (cf §14.1.3). 
Since this behavior reduces the consumer surplus, the overall WTP for the platform falls 
and so do profits, independently of whether the firm is a monopoly or an oligopolist. This 
shows that, in a world devoid of risk, firms would like to commit over their future prices. 
If they so frequently pass on this opportunity, it must be due to uncertainties regarding 
the evolution of input prices, technological development or future demand. We analyze 
the two situations in turn before comparing them. 

Committed pricing A firm has unit cost y for the asset and c for services, profit is thus 
71 - if ~ Y + (P ~ c ) <?) D (/> P) where q is the service demand of a customer facing price p and 



D{f,p) the market share of the firm when she offers the bundle (/, p). 32@ If the firm can 
commit to future prices it is optimal to set p = c so that profit becomes n = [f -y)D{f,p) 
and the FOC is (/ - y) |§ + D = => ^J- - j-. We thus observe that when firms compete for 
customers through the access fee only, profit maximization leads to equate the Lerner 
Index to the inverse elasticity of demand (cf eq. (3.4)) with the important proviso that 
the traditional unit price is replaced by the access fee. 

As product differentiation lessens or as the number of firms increases, the elasticity 
of access demand facing any single firm is likely to increase, causing the equilibrium 
fixed fee and price-cost margin to fall. 

Uncommitted pricing If the firm is unable to commit to future prices, then after- 
market monopolization occurs in the sense that the firm cannot help but sell her flow 
services at the standard monopoly price i.e., "rip-off" her customers. The FOC is com- 
monly expressed with the Lerner index as ^-^ = j-. Consumption and consumer sur- 
plus are greatly reduced (wrt. the previous case). In anticipation, clients require a 
very low entry price or even a subsidy. Indeed, the FOC for setting the access fee is 
f-y+(p-c)q = ll ^f ^£jt = ±-?3-j- (using the other FOC). The Lerner index for the 
entry fee is a modified inverse elasticity because the impact of after-market monopoliza- 
tion must be subtracted. The margin may be negative meaning that the asset becomes a 
loss leader . 

Examples abound with mobile phone plans including a free high-tech phone bundled 
with a two-year permanence and minimum billing, Nespresso's affordable gorgeous ma- 
chines and expensive coffee capsules, HP's cartridge set which is more expensive than 
the printer or Brita's water jug cheaper than a pack of filtering cartridges. 

Comparison The absence of commitment generates a deadweight loss from the mono- 
poly pricing, but competition for customers puts strong downward pressure on access 
fees that conceivably could result in higher overall market penetration which is socially 
desirable. Yet, insofar as there are no serious income effect, the committed case yields 
more utility thus more participation and a larger welfare. It is also unequivocally yield 
more profit because the per-capita profit is larger. 

If the per-capita earning is the same then client's expenses are identical and the 
"committed" client must be happier as he consumes the efficient amount. 33@ We may 
then increase the "committed" firm's access fee until the client is equally happy under 
either scheme; thus the "committed" firm earns more per-capita. However, equal utility 
implies the same market share and the same sensitivity to the fee 34@ and equality of 
^fP- for both kind of firms. In order for the FOCs to be satisfied, it must the case that 



the "committed" fee is lower (since ^fP- is decreasing in fee). This means that the client 
under "commitment" must enjoy a higher utility in equilibrium. As a consequence, fewer 
people choose the outside option so that market coverage is greater. 

5.3.4 Competition and Discrimination 

In this section on oligopolistic contract competition, we have expanded the "weapons" 
from price or output to (potentially complex) contracts. Among those, we find elabo- 
rated tariff structures such as volume discount, subscriptions or versioning which are 
all instances of differential pricing. This topic is thoroughly in chapter 4 for the case of 
firm holding market power and facing an exogenous population of potential clients. In 
the present paragraph, we study the interaction between competition and complex (non 
linear) pricing pricing following Stole (2007) and Armstrong (2007) . 

Regarding welfare or the social desirability of price discrimination, we may look at 
the impact of entry against a discriminating monopoly or at the repercussion of allowing 
price discrimination in an oligopoly. 

Perfect Discrimination 

Spence (1976b) argues that when a seller can price discriminate perfectly, she captures 
her marginal contribution to consumer surplus so that her profit coincides with her mar- 
ginal contribution to welfare. As a producer, she is lead to choose her product variety, 
quality and quantity so as to maximize welfare. Entry decisions are then efficient since 
a firm enters the market if and only if her profit which is her marginal contribution to 
welfare exceeds the entry cost. This is entirely true for a monopoly but in oligopolistic 
situations care must be taken. 

Regarding pricing, one must understand that perfect discrimination refers in fact to 
the greater availability of contractual means for buyers and sellers to negotiate their 
deals (cf §4.1.3 with the many examples of consumer surplus extraction schemes). A 
result ubiquitous of many models is that once firms are able to offer rich contracts, these 
induce an efficient bilateral relationship with each client. In essence, bargaining over 
the terms of the contract yields an efficient result, the Pareto optimal barter of the Edge- 
worth box (cf. §2.4.3). 35@ Perfect price discrimination thus allows firms to capture their 
marginal contribution to welfare. The non cooperative choice of characteristics are then 
efficient since firms face the same incentives as a social planner. 



Entry 

As shown by Bhaskar and To (2004), there is too much entry although a potential en- 
trant's profit is his welfare contribution. The hidden distortion is that this welfare con- 
tribution is computed for an inefficient bunch of characteristics on the part of the in- 
cumbents, so that it is greater than if the optimal reconfiguration was used as we now 
proceed to show in a particular case. 

In the circular city location model of Salop (1979) (cf §11.1.3), perfect discrimination 
lead firms to compete in a Bertrand fashion over every single customer. As intuition 
would suggest, the firm closest to a consumer wins him: efficiency obtains. The equilib- 
rium price is equal to the best that the second-nearest could offer which ends up being 
the difference in transportation cost. Indeed, consider consumer located at x between 
firm A located at and firm B located at \, his utility is either ua(Pa) = v-tx-pA or 
ub(Pb) v-t(~-x)-pB- Now, when x < ^, the best that B can offer is ugiO) = v - t{^ - x) < 
ua^Pa) o Pa ^ t (^ - x) - tx which is the optimal price offered by A to win the customer 
over B. The producer surplus is then 2/ 1/2 " j^-2txdx= ^. Taking into account the fixed 

cost of entry F, profit is -^ -F and the free-entry number of firms is Jjp = V2n* , where 
n* is the efficient level characterized in §6.1.4. Since free entry without price discrimi- 
nation leads to even more entry {h = 2n*), we may say that price discrimination reduces 
excessive entry but comes short of bringing full efficiency. 

Imperfect Discrimination 

As shown in §4.2.2, if firm discriminate among segments but are bound to use uniform 
pricing within segments then each segment is a standard oligopolistic market where 
goods may be homogeneous or differentiated. 

If firms have the same technology and demand in segment #j is the homogeneous 
Qj = aj-bjPj, the oligopolistic equilibrium is Q* = -^(aj -bjd), thus total output is 
Qo - Lj Q*; - t^tt^o - b c) (with obvious notation). Observe then that in the absence of 
discrimination, total demand is Qo = ao - bopo and the equilibrium is Qq. Thus, discrim- 
ination does not change total output in an homogeneous market, only the identity of 
buyers which is inefficient since low prices in elastic segments mean that people will 
low WTP buy the good while people with higher WTP fail to do so if they belong to low 
elasticity segments (where the price is set higher). 

We may thus conclude that price discrimination tends to reduce welfare as it gener- 
ates misallocation among consumers. More generally, when demand curves are nonlin- 
ear or some markets would not be served under uniform pricing, price discrimination 
may increase welfare as it may trigger a greater covering of the market. 36@ 



The effect on output of allowing discrimination in a duopoly market is not clear. 
Holmes (1989) shows that if the ratio of cross price elasticity to monopolist's elasticity 
is greater in the strong segment then total output increases because output falls less in 
the strong segment than it rises in the weak one. If the reduction in the strong segment 
is sufficiently small relative to the weak one, then welfare will also rise. Even less can 
be said of how equilibrium profit in the industry changes with the introduction of price 
discrimination (cf sec 3.3 in Stole (2007)). 



Chapter 6 
Strategic Moves 

Oligopoly with Sequential Decisions 



The simultaneous competition featured in the previous chapter, either through prices, 
quantities or contracts is only the last stage of the global rivalry that takes place among 
oligopolists i.e., once firms have chosen their technology and entered the market. In this 
chapter, we broaden our viewpoint and analyze entry, production and sales as sequential 
decisions whose rationality turns them into strategic moves. 

Our first section is devoted to the analysis of entry under a variety of market struc- 
tures and conducts but with a strong symmetry hypothesis: all firms, incumbents and 
potential entrants alike, have access to the same technology. We derive the long term 
equilibrium in each case and assess its efficiency. 

We then consider the opportunity for a firm to make a commitment such as producing 
and dumping a given quantity on the market using the Stackelberg (1934)'s leadership 
model. The motive for such a decision was seen in the previous chapter: a cost advantage 
translates into a profit advantage; thus, each firm will try to invest into a better produc- 
tion system or a better distribution network to gain that edge over the rest of the herd. 
The next sections then analyze variations around this issue with forward contracting 
(i.e., separation of production and sales) and vertical relationships as a mean to commit 
to a specific behavior. 

6.1 Dynamic of Entry and Exit 

The reduction of concentration through the entry of new firms in markets is key to foster 
competition and innovation among all contenders. This whole process is desirable as it 
eventually advances economic growth and general welfare. We study in this section the 
conditions under which a potential challenger takes the irrevocable entry decision i.e., 
spending the sunk cost that enables him to become an effective competitor in the mar- 



ket. The study of this long term market dynamic under a variety of market structures 
and conducts will serve as a reference for §10.2 where we tackle the possibility that in- 
cumbent firms establish barriers to entry. Recall indeed that the less concentrated is 
the market, the more intense is the competition and the lesser are the profits which 
ultimately means that entry harms incumbents. 

The welfare concept we have adopted in equation (2.23) is contingent on the current 
actors of the market. Indeed, consumer surplus derives from the demand which itself 
sums the individual demands of economic agents standing ready to buy in that mar- 
ket. Similarly, the producer surplus sums the individual surpluses of active firms, those 
which are ready to produce and sell. When considering entry, we must extend our defini- 
tion of welfare to account for the resources that the entrant spends (or save) to enter, the 
most important being the sunk cost that must be incurred to gain access to the market or 
the fixed cost that has to be disbursed in every period. Recall indeed that none of these 
is not captured by our producer surplus definition. 1@ 

The cost function we adopt in this section derives from a decreasing returns to scale 
(DRS) production technology. This more complex specification is needed to treat sev- 
eral market structures simultaneously; it nevertheless contains, as a limiting case, the 
constant returns to scale (CRS) we use in most of the book. 

6.1.1 Entry under Perfect Competition 

Firms choose to enter a market for an homogeneous good at a fixed cost F that accounts 
for building a minimally efficient plant or buying a governmental license. Although 
unrealistic, we assume that, whatever the number of firms present in this market, they 
are price-takers i.e., there is perfect competition. 

Let us consider a common technology for all firms characterized by decreasing returns 
to scale and postive fixed cost: C{q) = F + cq + 8q 2 l2. A first method uses the theoretical 
result according to which the long-term supply of an active firm is her minimum efficient 
scale which itself determines the long-term price (cf §2.1.4). The former solves AC = C m 
and is q = \/2FI8 so that the latter is p = C m {q) = c + V26F. The limiting number of firms 
is then the ratio of market demand at the long-term price D{p) = a-bp by the individual 
long-term supply q. Solving this equation yields 2 ® 

n-^-bS (6.1) 

I The sustainable number of firms in a perfectly competitive market is determined by 
the minimum efficiency scale and the demand. 



The alternative method is to derive the full equilibrium in order to get the (same) 
condition for entry. The individual supply of a price-taker producer solves p = C m = 
c + 6q o q = ^ = s[p); his profit is 

n{p) = ps{p)-C{s{p)) = yp-c-s{p)^s{p)-F 
= ^ S {p)-F since £^=p^ 
= ^(p-cf-F (6.2) 

The aggregate supply when there are n active firms is S n {p) = n^. The competitive 
equilibrium price, equating demand D{p) = a-bp and supply, is p* n = ^§§ ', individual 
sales are 

tn = s(P*„) = £§ (6.3) 

and profits are 

K^^p*) = l(^f-F (6.4) 

Since profit is computed using economic costs (i.e., including the opportunity cost of 
every factor but excluding sunk cost), the relevant benchmark for a firm manager to 
decide whether to be active or not is zero. The limiting number of firms n* > 1 solves 
7T* > > 7T* +1 and is the integer part of n* computed in (6.1). 

The comparative statics are in line with intuition: a greater market size a favors 
entry, as well as lower entry cost F, lower price reactiveness b and lower marginal cost 
factor c. When the fixed cost F vanishes, the long-term number of firms increase un- 
boundedly i.e., DRS commands many small firms. If the technology approaches constant 
returns to scale (5 —> +oo), then the long-term number of firms tends to unity i.e., CRS 
commands a single large price-taking monopoly in order to minimize total fixed cost. 

In connection with our extended definition of welfare as W{p* ,q*) - nF in the pres- 
ence of n firms, one might wonder whether the efficient number of firms would not be 
that which maximizes this expression and not, as we previously did, that which equates 
(individual) producer surplus to fixed entry cost i.e., solves it = F. It occurs that both an- 
swers coincide. To check this claim, we study the effect of an additional firm on welfare 
W{p* ,q*). Firstly, there is the direct effect of having an additional firm whose producer 
surplus n contributes positively to welfare. Then, there is the indirect effect on welfare 
through the effect on the market equilibrium; in the case of perfect competition, the equi- 
librium price maximizes welfare which means that the change of price induced by entry 
has no first order effect on welfare i.e., the indirect effect is nil. 3@ The overall effect of 
entry on extended welfare is thus n-F which is the profit of the entrant. 



6.1.2 Entry under Bertrand Competition 

As always with Bertrand competition, we reach a paradox: fierce price competition yields 
a monopoly outcome although there is free entry. The key to understand this puzzle 
is to note that an incumbent monopoly already paid the fixed cost F while a potential 
competitor must ponder the opportunity to spend F. If there is Bertrand competition in 
the market and firms have identical constant marginal cost c, then the equilibrium price 
after entry drops to c. As a consequence, both firms earn zero producer surplus; this is 
dramatic for the entrant because it blocks him from recouping the initial fixed cost F. 
Hence, because competition is fierce no one dares to enter and the incumbent can enjoy 
monopoly profits! 

If, however, the potential entrant discovers a better technology, he will have a lower 
marginal cost c, will enter and force the incumbent to exit. Indeed, we saw in §5.2.1 that 
the cheapest of the two firms in presence wins the entire market and earns a margin 
equal to the cost difference, here c-c for the challenger. If the resulting profit {c-c)D{c) 
is larger than the fixed cost of entry F, then entry is unstoppable and will occur. The old 
monopoly is now excluded from the market which means that the entrant has become 
the new monopoly; this situation will last until a better technology is discovered and the 
same story repeats itself. 

The condition for unstoppable entry {c-c)D{c) > F is equivalent to an improvement 
in marginal cost greater than -^ « -^, since the monopoly quantity is roughly half the 
competitive one for linear demand and constant marginal cost. As a matter of compar- 
ison, the profit margin for the monopoly p M - c has to be greater than -^ to make his 
presence worthwhile in this market. Thus, the cost improvement must roughly be at 
least half of the current monopoly profit margin. 

From this analysis, we conclude that markets where price is the strategic variable 
and where production capacity is cheap to build (constant marginal cost) should be highly 
unstable; they are good examples of the Schumpeterian process of creation-destruction. 
When production capacities matter or equivalently, when marginal costs are increasing, 
things becomes more difficult to analyze as we already commented in §5.2 and a conve- 
nient competition framework is the Cournot one that we now proceed to study. 

6.1.3 Entry under Cournot Competition 

In this section, potential competitors share the cost function C{q) = F+cq + 8q 2 l2. If there 
is Cournot competition in the consumer market, the profit of one firm is jii{q-i,qi) = 



qiP{q-i + qt) - C{qi) and the FOC of maximization is 

R m = C m o a ~ q -f 2q ' = c + 8q t (6.5) 

leading to the best reply function q t = BR{q-i) = a ~2+sb l ■ ^ ne symmetric Nash equilib- 
rium among n active firms solves 

q = BR {{n -l)q)o q{2 + 8b) = a-bc-(n-l)q (6.6) 

and is 

C _ a-bc a-bc * (qh} 

°ln ~ Sb+n+1^ Sb+n ^n ^•" 

seen in equation (6.3). 

We immediately deduce that total supply nq„ is lower than in the competitive equi- 
librium (for the same number of firms), thus the equilibrium price is greater: p c > p* . 

More precisely, 

c a{8b+l) + nbc 

P » E bl8b + n + l) (6 ' 8) 

Finally, using equation (6.4) we obtain the individual profit: 

x C n = ^(Pn-c) 2 -F>^(p* n -c) 2 -F = n* n (6.9) 

using equation (6.2). We can therefore conclude after von Weizsacker (1980) that the 
limiting number of firms 

n c = s+ Vb { a _ bc _ by /2gp\ (610 ) 



is larger than n* since the lesser competition yields higher individual profits and allows 
for more firms to be active than would be efficient. Even in the limiting case of constant 
returns to scale (8 —■ 0), the Cournot price p„ tends to al ^X\ C an ^ remains above the 
marginal cost c, so that firms keep earning extraordinary profits or rents. 

I A market characterized by Cournot competition tends to be flooded by a too large 
number of firms charging too high prices. 

Notice that two opposite forces are at work; on the one hand, entry is welfare enhanc- 
ing through the increase of the entrant's profits but on the other hand, entry is welfare 
reducing through a "business stealing" effect: in reaction to entry, all incumbent firms 
strategically reduce their production. 

The intuition held for a long time in the economic profession according to which "free 
entry is conducive of a higher degree of efficiency" is in fact wrong. It is misleading to 



dissociate the issue of pricing from that of entry because both of them matter to assess 
the efficiency of a market. True, the entry of a new competitor increase rivalry in the 
market and leads to lower prices and greater consumer welfare but this is only one 
part of market welfare, our efficiency concept. There is an indirect effect of entry that 
was neglected in early studies, the reaction of incumbent firms to entry. If the pricing 
behavior of incumbents is such that, the reaction to new entry is to increase price or 
reduce individual production, then entry surely reduces welfare because the reduction of 
profits for the incumbents outweighs the gain in consumer surplus. Somehow, the entry 
of "bad" economic agents such as firms with market power can be a source of additional 
inefficiency. 

6.1.4 Hotelling Competition 

The Hotelling duopoly model of differentiation (cf §5.2.2) is extended to oligopoly in 
§11.1.3. This circular city model of differentiation is amenable to the computation of 
entry. When n firms are active, the equilibrium price is ^ and each firm serves her fair 
share - of the market earning profit n n = \-F where F is the entry cost. The limiting 
number of firms is then n = J p. 

Welfare, however, commands to minimize total cost made of fixed cost nF and trans- 
portation cost since the market is closed and each consumer buys a single unit. Now, 
there is a distance - between any two firms so that the maximum distance travelled is ;^- 

n J In 

(indifferent consumer located at mid-distance). Since people are uniformly distributed, 
the average distance travelled is half the maximum i.e., ^. The total transportation cost 
is thus C = 4^ and the FOC for minimizing nF + C has solution n* = J-^p = n/2. 

Consider lastly the case of a cartel among the n shops (cf. §9.1). To guarantee market 
coverage, the monopoly sets the price p M = p-^ that leaves exactly his opportunity cost 
to the consumer living at mid-distance (i.e., ^) between any two contiguous shops. The 
monopoly thus cares for the maximum cost £-. To decide on the optimal number of shops 
(or brands), the monopoly compares the fixed cost of setting up one more shop F with the 
increase in price ^ - 2(w f +1) that he will be able to apply. The condition for opening one 

more shop is n{n + 1) < jp. The approximate optimal number is thus n M = y jp- 

We may thus conclude that, in the Hotelling framework of horizontally differentiation, 
a monopoly opens some 41% more shops than what efficiency commands but that open 
access competition leads to an even stronger 100% excess opening. Once we make the 
alternative interpretation of transportation cost as opportunity cost for characteristics 
(cf. §11.2.1), we may say that competition leads to variety proliferation that even 



I monopolization would fail to stop. 



6.1.5 Generalization f 

In a more general setting of imperfect competition, Mankiw and Whinston (1986) show 
that entry is excessive if, in a symmetric equilibrium, the individual quantity decreases 
with entry. As we just saw, Cournot competition is one such case. To prove this result 
formally, denote q n and Q n the individual and total sales in a symmetric equilibrium 
with n active firms. The individual firm equilibrium profit is n n = q n P{Qn) - C{q n ), hence 

dn n dn n dq n dn n dQ n dq n n ,dQn to -.,* 

~^~ = ^ ^~ + ^7T^ — = (P-C m )—— + q n P — — (6.11) 

on oq on oQ on on on 

Market welfare in the presence of n firms is W n = nn n + W D {Q n ). We use equation (2.20) 
to write ^p = -Q n P'{Qn) - -nq n P'iQn) since q n - Q n ln in a symmetric equilibrium, thus 

dW„ dn n dW D dQ n dq n 

— — = n n + n—- + — — — — = n n + n{P- C m ) ~^-<n n (6.12) 

on on oQ on on 

using (6.11) since we assumed -^ < and a rational firm adjusts its production (may be 
using rationing) to maintain the price above its marginal cost. 

We see in equation (6.12) the two effects alluded before, the (positive) entrant's profits 
and the (negative) "business stealing" effect. Entry will occur until n n = is satisfied 4 ® 
while it ought to stop sooner when ^ = is satisfied. All in all, there is excess entry. 

The same idea can be pursued even when products are not perfect substitutes i.e., in 
the presence of product diversity. It is however necessary to adopt a convenient formu- 
lation of consumer surplus like the one proposed by Spence (1976a). Letting q = {qt)i< n 
be the production of the n firms, the gross consumer surplus (net of price paid to sellers) 
is W D {q) = <D (Xj£(<7;)) where O and £, are concave increasing. Total welfare in a symmetric 
equilibrium is now W = Q>{nt;{q n )) - nC{q n ). Surplus maximization leads consumers to de- 
mand a quantity qi of variety i such that pi - ^^ = O'^', hence the profit of a firm in the 
symmetric equilibrium is n n = &£,' q n - C. Now we get 

<D'£-C+(0>'£ -C m )n- 



dn " dn 

= n n + {<!>'S' -C m )n-§± + <&$-$' q n ) (6.13) 

on 

We observe that (6.13) is similar to (6.12) but with an added term, the last one, whose 
sign is positive by the concavity of £. It simply means that because consumer like variety, 



entry contributes not only to competitiveness but also to increase the desired variety 
thus is not as bad as in the case of homogeneous products. In this generalization, no 
definite conclusion can be reached unless we make some further assumptions regarding 
the pricing behavior. Roughly speaking, if firms are close to be price-takers then diversity 
is insufficient while it is excessive when firms have significant market power. 

6.1.6 Market Integration 

The political acceptation of an economic integration process such as the European one is 
strongly dependent on the effect it may have on national market structures and national 
welfare. One element related to the present theory of oligopoly competition is whether 
we shall see more or less active firms in the long run once integration has taken place. 

Quantity Competition 

To resolve this question consider two national markets for the same homogeneous good 
where national firms compete in quantities and share the same production technology 
displaying decreasing returns to scale (DRS) and a positive fixed cost of entry. Before 
studying integration, we have to look at the autarchic long-run Cournot equilibrium in 
each country. Autarchy is the relevant hypothesis because import taxes or import quotas 
in each country prevent foreign firms from participating in national markets. 

The long-run Cournot market structure in a single country is a number n of national 
active firms. When integration takes place, each market has suddenly 2n active firms so 
that the equilibrium price falls in both markets. If we assume that demands are identical 
in both countries (i.e., economies are of similar size), the profit of an individual firm is 
2n c - F since it is now active in the new economic area consisting of both countries. 5 ® 
The entry condition being now n c > F/2, the long run market structure of the integrated 
economic area is that of a (new) single country where the fixed cost of entry would be 
twice smaller. The integrated economic area therefore offers more room for firms but not 
twice more i.e., the long-term number of firms m satisfies n<m< 2n. 

In the example we have been using to study oligopoly in §5.1, the long term number 
of firms in a single country is the solution of n c = F where, for simplicity, we neglect 
the integer problem. Using equation (5.12), we derive n = ^^ - 1. The post-integration 

VbF 

stable number of firms solves, on the other hand, n c = F/2 which yields 

m _ sf^g^fc _ l = n + (v /2_ X ) azbc <n+ ^_ l)n = nv /2 

VbF VbF 



which is more or less 70% of 2n, the initial number of firms in the integrated economic 



area. Hence, some 30% of the original firms will have to exit the integrated market. 6 ® 

Perfect Competition 

Let us compare this result to what happens when both national markets are perfectly 
competitive. We know that previous to integration, the autarchic long-term equilibrium 
in country A features firms which are all selling at their minimum efficient scale c{a 
(the level of production minimizing the average cost) while the price is pA = C mi AiqA) = 
ACa{cjo)- The long-term number of active firms is then given by the equality of demand 
and supply i.e., ha = D a {Pa)I^a- The same goes on in country B. When the two economies 
merge, either everything remains the same because prices were already equal or there is 
a comparative advantage for one country, say A, if Pa< Pb- In that case, the production 
technology of country B become obsolete and all firms using it have to close gradually 
since the long-term price is Pa which is unsustainable for them. To meet the demand 
of both countries with the better technology (A), new firms will enter the market; there 
is obviously no condition of nationality for them since our simple model has neither 
administrative nor transportation cost to limit entry within the integrated area. 7@ 

Summing up, economic integration of perfectly competitive market enables all con- 
sumers to take advantage of the best available production technology while the supply 
side of one country may need to reorganize itself to meet the standard set by the other 
(currently better) country. The fact that the better technology leads to greater sales 8 ® 
means, as a consequence, that more inputs will be used in the integrated economic area 
as compared to what was previously used in the autarchic countries. The geographic 
origin and composition of these inputs (e.g, unqualified labour, qualified labour or capi- 
tal) cannot be assessed in such a crude model. However, the origin of the comparative 
advantage we alluded to before can give a hint of which input is most needed in the econ- 
omy; the most frequent sources of technological advantage are the wage level, the cost of 
capital and the level of human capital of national workers (productivity). 

Conclusion 

Whatever their initial market structures, the integration of two economic areas has a 
globally beneficial effect since the long-term number of active firms increases in both 
countries which means that the price will fall everywhere thereby generating a greater 
consumer surplus and also a greater total welfare. This is so in the Cournot model 
because the fixed cost of each remaining firm is spread over larger sales thanks to the 
duplication of the market size. To sum-up: 



Economic integration is good for all sectors even-though some firms belonging to inef- 
ficient oligopolistic markets will have to close down; their factors, capital and labour 
then find employment in the more competitive sectors of the economy where the inte- 
gration process creates an additional demand for factors that favors either the creation 
of new firms or the expansion of existing ones. 

There is few doubts that the early economic integration of the US states during the 
20 th century contributed meaningfully to the might of the US economy. A likewise ob- 
servation can be made of the European integration after the second world war. 

6.1.7 Contestability 

Baumol et al. (1982)'s theory of contestable markets present an ambitious theory of in- 
dustrial organization which breaks away from the SCP paradigm but also with the core 
methodology of this book, strategic behavior. 

We first summarize the theory before succinctly presenting its formal aspects. Next, 
we turn to its theoretical shortcomings and finally to the lasting impact it has shown in 
the policy arena. 

Basic Tenet 

Let us start with an historical clearcut example. The Ballpoint Pen was patented in 
1938. In 1945, Milton Reynolds bought one and copied the idea bypassing the patented 
capillarity mechanism with a slightly less efficient gravity one. By doing so, he beat 
the official distributor to the US market and sold his "Rocket" pen as a luxury item, 
generating large profits. Within one year, all incumbent pen makers had entered the 
market pushing the price down from 12$ to 3$. With even more challengers, the price 
dropped to 40 cents in 1948, so that Reynolds cleverly sold all his assets in 1951, cashing 
on his initial success. 9 ® 

Contestability is most best understood as an extended Bertand paradox (cf. 5.2.1): 
if an incumbent prices so as to obtain an 8% rate of return on investment whereas the 
market average is 6%, then a potential entrant can actually enter the market at a lower 
price, capture all demand and still make a 7% return that keeps beating the market. 
This situation will last until the incumbent is able to react with an even lower price to 
recapture some market share. Even if he ends-up being expelled from the market, the 
transitory entrant has managed to earn an above average return for some period of time; 
this, in itself, warrants the initial entry. The incumbent being equally rational, cannot 



ignore the menace. In other words, meanwhile she does not adjust her pricing strategy 
so as to yield the average return on investment, "hit and run" will continue to take place. 
To understand the potential extent of contestability theory, it is worth recalling that 
perfect competition does not cover the case where a technology displays economies of 
scale or scope (cf §2.3.1). Thus, the welfare theorems do not apply for natural monop- 
olies and fail to provide us with an efficient benchmark for policy prescription. Filling 
this void has been hotly debated since the 1930s. Tenants of SCP call for regulation as 
exposed in §17.1 while the Chicago school seeks to axe all barriers to entry i.e., create the 
conditions for free entry in order to unleash the forces of potential competition. 10@ The 
theory of contestable markets formalizes this latter point of view by generalizing per- 
fect competition to cover natural monopolies and more generally, industries displaying 
economies of scale. The key features of a contestable market are free entry and the exis- 
tence of at least one potential competitor. The equilibrium in such a contestable market 
is then extremely efficient: 

• If technology calls for a natural monopoly, there is one active firm and she prices her 
bundle of products in a Ramsey-Boiteux fashion 11@ so that second best efficiency is 
achieved. 

• If technology calls for two or more active firms, then the number of firms minimizes 
industry cost; furthermore these firms price all their goods at marginal cost so that 
first-best efficiency is achieved i.e., "two is enough for competitive outcomes". 

The policy implication of these results is obvious: if a market is contestable, no regu- 
lation at all is needed because the hidden forces of potential competition literally compel 
incumbents to behave efficiently. If the market is initially regulated, it should be de- 
regulated. As we explain hereafter, the contestability theory is too thin to support such a 
call but other models of industrial organization have shown that administrative barriers 
to entry that protect incumbents have negative welfare consequences and should thus 
be removed. 

Theory and Critique 

The main academic achievement of contestability theory is the characterization of effi- 
cient cost structures for a multi-product production technology; as reported by Baumol 
and Willig (1986), it has been used in many econometric studies of regulated oligopolis- 
tic industries. Since this topic is intensive in mathematics, we skip it and focus instead 
on the implications of contestability for competition and market performance which are 
almost formula free and can therefore be exposed here. A contestable market satisfies 
the following hypothesis: 



• HI: The best incumbent's technology is available to anyone. 

• H2: The profitability of entry is evaluated at the current price. 

• H3: Fixed costs are not sunk. 

(HI) implies that the product sold by active firms is homogeneous since they all share 
the same production technology. It also implies that incumbents derive no quality ad- 
vantage from the positive experience of actual clients, nor goodwill from their histori- 
cal presence in the market. (H2), the Bain-Sylos postulate, means that a potential en- 
trant completely disregard the reaction that his entry will trigger among incumbents. 12 ® 
Lastly, (H3) means that although a fixed investment may be necessary to enter, the en- 
trant can hit-and-run the market at no cost; his capital investment is not specific to this 
market and keeps its full value for use in another market. 

The next fundamental ingredient to the theory is the ubiquitous presence of a poten- 
tial competitor, in the background, ready to "hit and run". 13@ When (H1-H3) are satisfied, 
it is quite easy to see that incumbents, whatever their number, are forced to price the 
product at their long-term average cost i.e., as if they were perfect competitors. No reg- 
ulation is needed to guarantee efficiency since firms, including natural monopolies, will 
(auto) regulate themselves towards that goal. Contestability is thus a logically straight- 
forward theory building on clear-cut assumptions. As with all economic theories, the 
latters are not entirely fulfilled in real markets: 

• HI: A potential competitor need not exist because the incumbent's technology is not 
freely available; rather technologies are protected by patents, regulatory capture 
through lobby groups, or simply experience and know-how. 

• H2: The Bain-Sylos postulate is turned upside down by firms' quickness to react 
to a changing environment. It is customary to see a major player reacts within 
days or weeks with a price cut, an improved warranty, a special gift or an adver- 
tising campaign to counter the entry or launching of a new competing product or 
service. 14 ® 

• H3: There are virtually no investment that can be freely moved from a market to 
another, hence some of the fixed costs are sunk even if a small share. 

Weak empirical contrasting is however not an impediment to the success of a the- 
ory. The perfect competition paradigm (PC) is a polar framework that never fully ap- 
plies to real markets; its fame derives from its ability to serve as a benchmark. Indeed, 
economists have been able to relax the strong hypothesis of PC to construct models of 
imperfect competition and compare them to the PC benchmark. So, we know better 
the contribution of each PC hypothesis to the welfare theorems and this is also useful 



for policy recommendations. Such an extension process has not taken place within the 
theory of contestability. 15 ® Whereas nearly competitive markets have nearly efficient 
outcomes, Baumol (1982) confess that nearly contestable markets might have nearly 
efficient outcomes. Contestability theory is thus inflexible and not amenable to change 
and progress. 16@ Lastly, contestability assumes away strategic behavior (H2) in order to 
achieve logical simplicity. By going contrariwise to the general development of scientific 
thinking in the field of economics, it has put itself outside the paradigm of modern eco- 
nomics. 17@ Nonetheless, this vision has made its way into the policy arena and for that 
reason cannot be altogether ignored. Note finally that the modern use of the terminology 
points at models of entry barriers based on game theoretical reasoning. 

Impact 

The sharp conclusions of contestability have been adopted by lawyers defending firms 
standing accused in antitrust cases starting with its architect William Baumol, a long 
time consultant for AT&T. 18@ The following quote by economics professor Richard Schmaler 
illustrates this. Upon being hired by Microsoft in the antitrust case over the tying of the 
"Internet Explorer" browser to the "Windows" operating system, he declares that "the 
extremely high market share of Microsoft does not translate into market power because 
the company faces the threat of potential competition. Of great concern to Microsoft is the 
competition from new and emerging technologies, some of which are currently visible and 
others of which certainly are not. This places enormous pressure on Microsoft to price 
competitively and innovate aggressively" (cf §24.1). 

The "invisible technologies" alluded to in the previous statement are a reality; the 
history of the software industry is replete with case of sudden death of an incumbent 
upon appearance of a better product launched by a challenger. Microsoft which itself 
successfully outplayed competitors is therefore right to be anxious about this treat. Yet 
the response does not seem to be aggressive pricing 19 ® but rather the preemptive buying 
of small firms that have developed and patented new and interesting innovations. 20 ® 

As we indicate in §24.3.5, Microsoft achieved a quasi monopoly with its software "Mi- 
crosoft Excel" and "Microsoft Word" at the end of the 1990s; however, after the unin- 
terrupted price fall of the 1990s it did not raised its price as we would expect from a 
traditional monopoly. That is an indication of the seriousness of the threat alleged by 
Schmalensee. However, Microsoft products remain more expensive than the competitive 
fringe and have a rather short live span due to the Microsoft policy of issuing new ver- 
sions that force users to upgrade (the backward compatibility is of low quality). This 
strategy can be then seen as the optimal policy of a monopolist selling a durable good (cf. 
§4.3.5), all the more since "Office XP" has been marketed as a leasing product in several 



countries instead of being sold. 

The very idea of contestability is also related to Markides and Geroski (2004)'s "fast 
second" notion whereby the ability to copy a first mover may save on cost of development 
while losing little in terms of image and thus be overall advantageous (cf. Lee (2009)). 
An example is how MySpace was gobbled up by Facebook and Twitter in 2009. 

6.2 Stackelberg Leadership 

We now study how a firm can weigh in a profitable manner on the terminal stage of price 
or quantity competition by taking an observable and sometimes costly decision which 
irrevocably changes her behavior. 

6.2.1 Intuition 

Stackelberg (1934) observing the competition among mineral water producers in Ger- 
many notices that one firms seems to be a leader whose changes of prices systematically 
trigger responses by all other contenders. To explain and rationalize such a sequentiality 
of moves, this author modifies Cournot (1838)'s model of quantity competition. 

Keeping in line with our previous farmers' story imagine that farmer 1 takes a lead- 
ing action by announcing publicly the quantity q\ it will bring to the market (e.g., pub- 
lication in the local newspaper). Farmer 2 who forgot to do the same becomes a follower 
and its best response to the announced q\ is to bring q 2 = BR^iqi). It is crucial to under- 
stand that farmer 2 cannot credibly lead farmer 1 or anyone else to believe that he will 
bring a quantity different from BR^iqi). 

The behavior of farmer 1, in this new context, differs qualitatively from that of farmer 
2 as he can anticipate that his announcement c\\ will be followed by BR^iqi). A first and 
obvious result is that farmer 1 will be better off than in the Cournot equilibrium. Indeed 
he can choose q^ in which case farmer 2 replies with q£; thus farmer 1 obtains at least 
the Cournot profit U^. We now study his incentive to deviate from the Cournot value. 

Recall first that farmer 2's reaction is decreasing with the commitment c\\ (this is 
the basic property of Cournot competition). In the Cournot framework, when farmer 1 
increases his sales by, say 50 kg, market sales grows by 50 kg forcing the market price to 
fall by, say 8€. The optimal level is then when the additional profit made on the 50 kg 
just equates the margin loss of 8€ over his total sales. In the commitment framework, 
when farmer 1 increases his sales by 50 kg, farmer 2 reduces his own sales by, say 25 
kg in response; hence, market sales grows by 25 kg only and the price falls by a smaller 
amount, say 5€. This is attractive for farmer 1 because the profit made on the 50 kg 



is as before while the margin loss over his total sales is 5 instead of 8 per kg, thus the 
additional sales are profitable under the commitment while they were not under Cournot 
competition. 

The usual problem with increasing sales for firms with market power is that such 
a move depresses the price and thus the profit margin. The value of a commitment to 
large sales is to force the challenger to reduce his own sales in a "desperate" attempt to 
detain the price tumble provoked by the commitment. The leader is thus shifting part of 
the problem on the challenger and this is why increasing sales can be profitable when it 
would not be under a regime of simultaneous Cournot competition. 

6.2.2 Formal Analysis 

Formally, the merit of the commitment is seen by exploring in detail the marginal rev- 
enue of farmer 1. Indeed, the commitment affects only the behavior of the other farmer, 
thus the demand side and not the supply side (cost function). In the Cournot framework, 
the quantity sold by farmer 2 is, from farmer l's point of view, a parameter insensitive to 
his own sales decision while in the commitment framework, the quantity sold by farmer 
2 is a response to the commitment, we thus have 

R m>1 = P + P'q l + P'^ = R m ,i + P'^- 
dq 1 dq± 

because he knows that his decision q\ will affect the decision q 2 of farmer 2. As q 2 = 
BR^iqi), we deduce -S- < so that the marginal revenue of additional production is 
greater than under the Cournot competition; hence there is a motive for increasing pro- 
duction which implies that the other farmer will have to reduce his own. Nevertheless it 
is not clear whether total sales will increase or not. 

In our linear example, the relation between sales q\ and price becomes 

a-qi-BR^{qi) _ g+bc-qi (R-\A\ 

P- b ~ 2b KO.Xt) 

using (5.4): the demand addressed to farmer 1 is twice more elastic than in the Cournot 
game; this leads farmer 1 to produce a greater quantity (as we already saw). Equiva- 
lent^, the profit to take into consideration for farmer 1 is not rii [q\,q 2 ) where a value of 
q 2 has to be guessed but 

Yli[qi,BR%(q\)) = q x {p-c) = \q\[a-bc-q\) (6.15) 

using (6.14). The optimal choice, qf, maximizes (6.15); it solves the first order condition 
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which in turn leads farmer 2 to respond with 

_S _ H D C r S-> _ a-fcc ^ _C _ a-bc 
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It is a matter of algebraic computations to check that the Stackelberg equilibrium price 
is p s = a+ l£ c < p c = a+ ^ c , the Cournot price so that efficiency unequivocally increases 
(total consumption increases). Regarding equilibrium profits we obtain 

j-rS _ J_ {a- be) 2 < j-rC _ 1 (a-bc) 2 _ j-rC y\ s _ 1 (a-bc) 2 
2l6b 2gfo 1 l8i> 

so that there is indeed a first mover advantage but also 

u S + n S = ^(a z ^ < 2(a z bcf =n C + u C 

meaning that industry profits decrease by 2I9 3 ^ 16 - 18% which does not come as a sur- 
prise since the price is lower. 

To conclude, the flooding of the market by farmer 1 reduces the cake to be shared by 
18% but enables him to grab two thirds instead of one half, so that he ends up better by 
12% but farmer 2 loses 44% of his original profit. 

6.2.3 The value of Commitment 

Time Consistency 

The Stackelberg model of leadership (and the first mover advantage result) although 
appealing has a serious drawback because it is time inconsistent: at the moment where 
farmer 1 has to take his quantity decision he is tempted to renege his word (the an- 
nouncement he made previously). Indeed, when facing g|, his best reply is q[ = BR^iq^) = 
3«z_k_£ w hich is between q^ and qf (check it on Figure 5.1 by drawing qf, then the best 
reply q\ and finally the best reply q[ to q^). In simple words, if 60 is the Cournot quan- 
tity, farmer 1 announces 80 to force farmer 2 to reduce his own sales from 60 down to 
40 but once this is achieved, farmer 1 would like to sell only 70! The reason for that is 
quite simple: (60,60) is the only stable pair which means that (80,40) is unstable in the 
sense that at least one of the farmer prefers a different choice. Now, since 40 is optimal 
vis-a-vis 80, it must be the case that 80 is not optimal vis-a-vis 40 and this is exactly what 
is happening here; in front of 40, the optimal response is 70. 



Legal System 

This paradox (and its solution) can be understood with the help of Figure 6.1 below. 
Farmer 1 plays first by announcing a quantity. We see on the lower branch that q^ is 
followed by the optimal response q£. If now farmer 1 announces the large quantity qf 
then we expect farmer 2 to respond with the smaller q\. After observing the irrevocable 
choice q\, farmer 1 would like to revise his quantity down to q[. What can possibly happen 
in that case? It could be the case that Farmer 1 is sued by angry rationed consumers and 
condemned by the judge (J) for lying in the newspaper. If the penalty is large 21 ® than 
then it is optimal to keep the promise and bring the quantity qf to the market, so that 
in turn it is optimal for farmer 2 to bring q%. 

' i s 

^2 Revise qi < qi ^-^ ^ Penalty 

uK 

^Nothing 




(lj — Stackelberg outcome 



Cournot outcome 

Figure 6.1: Sequence of Announcements 

Consider now a twist of our basic story where the advertising says in very small 
characters that "the announced quantity is non binding obligation". The judge, when 
called to act, would turn down the consumers because the advertisement only referred 
to an intention, so that no malevolence is involved. Reassured by this judgment, farmer 
1 would rationally choose q[ over qf Anticipating that the announcement was just some 
kind of advertising (economists say "cheap talk") farmer 2 would therefore bring q' 2 = 
BR^iq'^. We have just started the process of optimal revisions that bring the two farmers 
back to the Cournot equilibrium. 

Interpretation 

Our little digression within the judicial system tells us that the Stackelberg model ap- 
plies only to situations where the first mover can make sure that he would suffer if he 
were to change his mind later; in other words, the first mover must be able to tie his 
hands to convince the follower that what he said will indeed happen. The public an- 
nouncement is a device to commit to something that would otherwise be untenable. 

Notice that the existence of the judicial system is necessary to enforce the Stackelberg 
equilibrium for otherwise the equilibrium is the Cournot one. The paradox is that in 
equilibrium, farmer 1 always keeps his promise so that the judge is never call to act. 



Society is thus paying judges for no actual work. It is precisely for this reason that the 
judicial system has to be publicly financed; being inactive it cannot sell its service and 
cannot therefore recoup its cost. 

Nonetheless, trade which is the basis of market economies relies on morality, trust 
and an effective commercial law. It is quite simple, using Figure 6.1 in a more general 
context, to understand that the inefficiency and corruption of judicial systems in develop- 
ing countries is undermining the whole trading system; this issue of "good governance" is 
now recognized as a major impediment to growth and actively promoted by international 
organizations like the OECD or the World Bank. 

6.2.4 Business Strategies f 

The idea of strategic commitment "a la Stackelberg" is extended by Fudenberg and Tirole 
(1984) and Bulow et al. (1985) to study the global effect of an investment A;i in capacity 
or advertising by firm 1. This firm competes on the consumer market against firm 2. The 
strategic variables are X\ and x 2 (e.g., prices or quantities). The profit of firm 2 depends 
obviously on the strategic choices in the consumer market but also on the previous choice 
k\ of her opponent so that we write it n 2 (A;i,X2,xi). 

In the last stage, both firms are rational, thus choose x\ and %2 to form a Nash equilib- 
rium of the situation where they fight, a situation that is potentially affected by k\. Hence 
the equilibrium is (xi(Ar 1 ),x 2 (A; 1 )) such that X\{k\) maximizes Ui{ki,Xi,Xj{ki)) for i = 1,2. We 
have 
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Regarding the direct effect we can distinguish two cases: 

• if fci is spend on advertising to increase fidelity of consumers then jh. < as the 
potential market for firm 2 is reduced. 

• if ki is a technological investment aimed at decreasing firm l's cost or improving 
quality it has no direct effect on firm 2 thus -^ = 0. 

The indirect effect is called strategic because it takes time to act: the decision of 
today, k\, will force firm 1 to change her behavior tomorrow. We say that the investment 



k\ makes firm 1 look tough (resp. soft) when the strategic effect is bad (resp. good) for 
firm 2 i.e., if g§| < (resp > 0). 

We now ponder the indirect effect of ki over firm l's own attitude by looking at the 
decomposition of ^- into a direct effect §p and an indirect one j^-^r to observe that 
whenever the strategic term is negative (resp. positive), firm 1 has a tendency to under- 
invest (resp. over-invest). 

We assume that the strategic variables x\ and x 2 are both prices or both quantity, 
more generally that -^ and -^ have the same sign. Using gp = -^ x j^- we obtain 
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We say that the strategic variables x\ and X2 are strategic substitutes when -^ < 0, 
as it is the case for Cournot competition. We speak of strategic complements whenever 
j^- > 0, as it is the case for Bertrand competition. 

We can now draw, using the colorful language of Fudenberg and Tirole (1984), a clear 
taxonomy of the strategic effect of the investment k\ for the leading firm 1: 

• If k\ makes firm 1 look tough and any increase in her strategic variable forces firm 2 
to reduce her own then firm 1 desires to overinvest to gain an additional advantage 
(beyond the inner value of the investment) i.e., be a "top dog". 

• If ki makes firm 1 look tough but compete with strategic complements then it 
prefers to underinvest to avoid tough competition later; it mimics a "puppy dog". 

• If fci makes firm 1 look soft and it competes with strategic substitutes then firm 1 
desires to underinvest or look like an "hungry dog" in order to show her readiness 
to fight in the last period. 

• If k\ makes firm 1 look soft but compete with strategic complements it prefers to 
overinvest to reduce her future aggressiveness, it mimics a "fact cat". 
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Table 6.1: Taxonomy of Business Strategies 



Beware that this taxonomy is limited to oligopoly and excludes monopoly and per- 
fectly competition. Indeed, the idea developed here rests on the indirect effect that a 
particular behavior has over a competitor's market strategy. 

6.3 Forward Market 

A simple way to commit to a large production in the future is to make some anticipated 
sales. This can be done using a forward contract which is a promise to deliver a fixed 
quantity at a future date in exchange of a monetary payment. Since forward trading is a 
pervasive instrument to protect oneself from risk, we briefly recall its functioning before 
studying the relation between forward contracting and Stackelberg leadership. 

6.3.1 Introduction 

Forward trading is either organized in a dedicated forward market, an exchange 22 ® or 
through bilateral transactions deemed "over the counter" (OTC). A party to a forward 
transaction adopts either a "long" position if she promises to buy or a "short" position 
if she promises to deliver (sell). 23 ® Since the main use of forward trading is to hedge 
the risk of a price variation in tomorrow's spot market, one might as well use a purely 
financial instrument called a "contract for difference" (CdF) that pays the (algebraic) 
difference between today's agreed price and tomorrow's price on the spot market (for an 
agreed quantity); that way a seller or buyer can guarantee himself a fixed price for the 
good he will transact. 24 ® 

When a trader who is not a producer holds a short position, a promise to deliver, he 
has no other choice but to buy the units in the spot market at the equilibrium price, be 
it high or low. Likewise, when a trader who is not a consumer has a long position, he 
has no use for the physical unit he is entitled to receive, thus he prefers to sell it in the 
spot market rather than keep a useless item, this whatever the standing price. Matters 
are different for producers holding forward positions since they can actually produce the 
units they must deliver or save those they will receive by virtue of their long position. 
We shall demonstrate later that a producer gains no advantage from this feature because 
the equilibrium is the same whether he behaves as a pure trader or not. 

6.3.2 Analysis 

Although forward markets have long been studied in finance, they were seen from a 
theoretical point of view solely as "hedging" instruments, a method to spread risk among 



many agents and therefore reduce its impact on individuals. Allaz and Vila (1993) offer 
a new rationale by showing that forward markets are also useful to reduce the market 
power of large participants in spot markets. 

Let us demonstrate this result using the Cournot model of quantity competition for 
an homogeneous good (cf §5.1). The spot market where the product is sold and bought 
is preceded by a forward market where anyone, firm, buyer or broker, 25 @ can buy or sell 
forward contracts; an algebraic position / in the forward market generates an algebraic 
gain (t - p)f to its holder where p is the spot market price and t is the forward market 
price. 26@ Hence / = 1 is a unit long position, a promise to deliver later while / = -1 is a 
unit short position, a promise to buy later. 

We assume without loss of generality zero production cost and denote ft the position 
of firm i = 1,2 in the forward market. When /! + f 2 > i.e., producers are globally long, 
some counter-party traders are entitled to receive physical units of the good; since spot 
market buyers are the only ones who value these units, they will end up buying them in 
the spot market. As a consequence, their overall demand D{p) will be met by the balance 
of forward contracts /! + f 2 and by the sales of producers in the spot market X\ + x 2 . If 
we denote qi = Xt + ft the total amount to be produced by firm i = 1,2, then the spot price 
satisfies D{p) = q x + q 2 o p = P{qt + qj) so that the total profit of firm i made up of spot 
sales and forward sales, reads %i = P{q t + qfiiqt - ft) + rfc. 

The marginal revenue of an additional unit produced (independently of where it will 
be sold) is -£ - P + {qi - ffiP'. It is important to observe the positive effect of forward 
sales (short position or / ; > 0) on marginal revenue. The negative term is smaller than 
in the Cournot model (no forwards) because the rebate that must be offered to current 
customers in order to gain an additional customer applies only to spot market sales, not 
the entirety of production (check the difference with equation (2.15)). Using our usual 
linear demand D{p) = a-bp, the best reply solving -^ = is easily computed as qt = ^-4? — '- 
which is increasing with /} thereby confirming our intuition that forward sales enable to 
commit to a larger overall production. Using the symmetric equation for the other firm, 
we can solve the equilibrium system and derive equilibrium quantities qt = J 3 + for 
i = 1,2; after simplifications, we find the equilibrium spot price p = 3 J b and the total 
sales Q=^^. 

As always in financial markets there are pure traders (brokers) in the forward market 
willing to exploit any arbitrage possibility. If the aggregate position of producers is long 
(/i + h > 0)> they expect a profit over the forward units if t > p, thus their counterparts 
expect a profit if r < p. We deduce that brokers are interested to take the short position, 
pay t now only if t < p. In that case they compete a la Bertrand over this opportunity 
which means that they will offer an higher and higher forward price t; in equilibrium 



there is equality. A similar reasoning holds when the aggregate position of producers is 
short, hence the spot price is perfectly anticipated by arbitrageurs so that t = p holds. 

In the initial stage where the producers have to decide on their forward sales, they 
take into account the future equilibrium of the spot market. Spot sales and forward sales 
are therefore made at the same price so that total profit is production times spot price: 

Krtfi.fj) = Pqi = ^{a-fj-ftKa-fj + 2fi) (6.18) 

The best reply to /) is easily computed by maximizing (6.18). The solution to © = is 

ft = ^-p-. Since individual forward sales are limited by the market size a, each firm will 
sell forward (/} > 0). The underlying reason is that quantities are strategic substitutes 
(cf §6.2.4), thus the (positive) strategic effect of anticipated sales is stronger than the 
(negative) direct effect of lowering the spot price. 

The equilibrium of the forward market is found by solving for the system of best 
replies; we obtain f t = fj = f out of which we compute qi = qj = ^f i.e., each firm sells | 
forward and then another | in the spot market. The equilibrium price at which every- 
thing is bought is p = fj- < p c = ^, the Cournot price (for zero cost), hence total sales 
increase as well as efficiency (recall from §2.3.2 that higher consumption is synonym of 
higher welfare). 

To summarize, the existence of a forward market with perfect foresight and arbitrage 
intensifies quantity competition and brings about more efficiency. Because forward sales 
are beneficial to a single firm, they enter into a race for selling in advance; this situation 
is very much like the prisoner's dilemma: it would be better to avoid forward sales, but 
if you are the only one to do so, it becomes a worthwhile option! (cf. §2.4.1) 

6.3.3 Comparisons 

The relation of forward sales with Stackelberg leadership is now quite simple to see 
(using the physical forward sales contract). If firm 1 is the only one who can sell forward 
then she chooses /i = § (the best reply to f 2 = 0), so that firms behave as in the Stackelberg 
game. Indeed, it is a matter of simple algebraic computation to check that q\ = qf and 

qi = qt- 

We have thus found a new way to commit to sales of qf = f . Firm 1 can sell im- 
mediately half of it and then compete in the spot market to (optimally) sell the other 
half. Thus, we can say that the commitment to bring a large quantity to the market 
or the forward sale are identical tools if only one firm has access to them. However, as 
soon as the two competitors can both commit or both sell forward, there is a difference 
among the two instruments. In the case of double commitment, it is as if there was no 



commitment at all, so that the Cournot outcome emerges again. On the contrary, the 
case of mutual forward sales represent a stronger commitment than the mere promise 
to fuel the market; both firms actively engage into this practice and get trapped into an 
unwanted flooding of the market that ultimately reduces their profits but increases the 
overall efficiency. 

We have to conclude with a word of caution spelled out by Mahenc and Salanie (2004): 
the social desirability of forward markets is crucially dependent on the mode of competi- 
tion in the spot market. As we explained above, forward sales always end up in the hands 
of buyers, thus reduce the demand appearing in the spot market and therefore depress 
the equilibrium spot price. When firms compete in quantities which are strategic sub- 
stitutes, it is more important to increase sales using forward contracts than sustain the 
spot price. On the other hand, when firms compete in prices which are strategic com- 
plements, the reverse property holds: firms prefer to act so as to sustain high prices 
even if this means selling less. They therefore take a long position (buy some of their 
own production) but do not go as far as selling their entire production in advance. 27 ® 
This might seems unnatural on the part of a producer. When such long positions were 
taken in 1978 by large coffee producing countries, observers thought it was an attempt 
to squeeze the traders holding the short positions; indeed, the latter might be forced to 
accept very high prices in the spot market in order to meet their obligation to deliver the 
physical good (since they will hardly be able to procure it elsewhere). The present theory 
shows that this might well have been a strategic behavior aimed at taking advantage of 
the countries market power where that term is understood in the classical sense and not 
as an exploitative behavior (cf sec. 4 in Mahenc and Salanie (2004)). 

6.4 Vertical Relationship 

In this section, we look at several strategic commitments. The first two examples look 
at the authority relationship with managerial compensation and manufacturer-retailer 
trade. We show that a relation governed by contract (aka "arm's length relationship"), 
can provide an advantage over integration where the relationship is governed by author- 
ity. Obviously, we leave aside the possible benefits of integration which are treated in 
§14. Lastly, we show how firms can use debt leverage to commit to a greater aggressive- 
ness in the market. 



6.4.1 Managerial Compensation 

The rational owner of a firm strives for profit maximization but she has to rely on a 
manager to achieve this goal. The latter then does not seem to care much for profits but 
rather for the size of the firm he runs (and the perks associated with the position). §13.1 
shows how manager's objectives can be re-aligned with the owner's views. §15.1, on the 
other hand, presents some arguments as to why excessive firm growth is inevitable. 

The originality of Vickers (1985) and Fershtman and Judd (1987) is to argue that, 
may be, it is in the interest of owners to have their managers maximize sales or revenue 
instead of profits. The reason is that most firms are neither monopolies nor perfect com- 
petitors (price-takers), hence their behavior influences the behavior of their competitors. 
In a context of Cournot competition, there is a first mover advantage to flood the market 
because it forces opponents to reduce their own sales. To achieve that advantage, it is 
enough for the owner of the firm to write a managerial compensation linked to sales (or 
revenue). 28 ® Because such a bright idea is immediately adopted by all firms, they be- 
come trapped in a prisoner's dilemma. Indeed, the new equilibrium is characterized by 
higher sales, a lower price, a higher welfare but also lower industry profits. 

To check this claim, assume that the traditional compensation package made of a 
base wage w and a share A of profits (bonus) is enough to align managers incentives 
with the maximization of profit. Consider now an alternative compensation scheme 29 ® 
that additionally pays a share ju of the firm's revenue R: 

w = w + Xn + fj,R = w_+ (A + /j.) {R - aC) 

where a = j£- since n-R-C. It is obvious that the manager will act (choose a quantity) 
so as to maximize R - aC. The owner can therefore tune a to obtain his desired strategic 
effect and use the other parameters to offer (on average) his opportunity cost to the 
manager (saturate his participation constraint). 

Let us now study the effect of this strategic tuning upon market strategies. Each 
firm owner signs a contract with her manager; later on, managers learn their marginal 
cost and the market demand D{p) = a- bp. The analysis of Cournot competition for 
asymmetric costs can be directly used to solve for the competition among managers. 
Indeed, the choice of the coefficient at by owner i for the compensation scheme of her 
manager amounts to change the true marginal cost c,- into a virtual one aiCt upon which 
the manager of firm i will base his behavior (he maximizes Rt - a/Q). 

Manager z's best reply function is equation (5.4): BRfiqj) = \{a-bct - qj); it is drawn 
as a downward sloping plain line on Figure 6.2. Observe that the lesser the cost a, 
the greater the response. Now, when the owner of firm i introduces a coefficient a* < 1 



in his manager's contract, the whole best reply curve moves up to BRt (dashed line) 
and the new equilibrium (circle) features a larger quantity for the strategic owner and 
a smaller one for her competitor. Since profits are increasing with the quantity sold, 
the strategic owner obtains a greater equilibrium payoff. The intuition of the result is 
simple: by linking wage to revenue, the owner turns his manager into an aggressive 
player behaving as if his marginal cost was lower. 




qi qi 

Figure 6.2: Strategic Effect of Managerial Compensation 

Being equally rational, owner j will also offer a distorted compensation scheme to 
her own manager in order to make him more aggressive. To compute the equilibrium of 
the Cournot game among "hypnotized" managers, we assume that firms share the same 
technology i.e., have the same true marginal cost c; in equilibrium quantities are given 
by equation (5.10): qt - \[a- bc{2ai - ccj)), but profits use the true cost hence 

iii = {p-c)q t = - — c b q ' q ' q t = ^ [a-bc{3 - a t - a,-)) [a- bc{2a t - aj)) 



for i = 1,2 and j ^ i. 

The objective of owners is to maximize profits minus the opportunity cost of their 
respective managers which is a constant; hence owner i maximizes s,- as a function of 
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ai and her optimal choice is easily computed as a, \ = ■ 

formula for owner j , we deduce the symmetric equilibrium a, = a 7 

sales are qt = qj = ha- be) just as in the forward market equilibrium 31 ® so that the same 

efficiency conclusions are valid. 

The same result obtains if the wage's bonus is based on sales q (physical quantity). 



As can be seen on Figure 6.2, the equilibrium involving the strategic use of managerial 
incentives catches both owners into a more aggressive competition. The new equilib- 
rium, at the intersection of the dashed lines, involves greater quantities, thus lower 
price and also a greater efficiency. 

Notice finally that there is no point to design distorted compensation schemes for the 
manager of a monopoly or of a competitive firm. Indeed, these are the cases where the 
competitors fail to react to the firm's behavior either because there are no competitors 
or because they are ignored by price-takers. Hence, owners will look forward to align 
managers incentives with their own, the maximization of profit. To conclude, the is- 
sue of distorting manager's incentive is akin to the strategic behavior characteristic of 
oligopolistic market structures. 

6.4.2 Vertical Integration 

We have just seen that it is in the interest of a firm owner to maintain a certain degree of 
misalignment of incentives with her manager. The possible vertical integration between 
an upstream and a downstream firm entails the same tradeoffs: if firms integrate, their 
interests will be perfectly aligned while if they remain independent, the upstream firm 
will be able to influence the behavior of the downstream firm through the wholesale 
price. 

In the case of the automobile industry, it is customary to see manufacturers enter 
exclusivity agreements with distributors whereby the dealer agrees to sell only the brand 
of the manufacturer in return for a local monopoly. The distributors of the various brands 
in turn compete for the final consumers. For such markets, Bonanno and Vickers (1988) 
show that it is in the interest of manufacturers to deal with independent dealers rather 
than integrate with them (absent any other benefit or cost of integration). 

To check this claim, consider a manufacturing duopoly whose downstream retailers 
compete in prices; it has been shown in §5.2.3 that these strategic variables are com- 
plements i.e, if pi rises (for whatever reason) then dealer 2 will optimally respond by 
raising his own price. Imagine now that manufacturer 1 raises her wholesale price w\\ 
this will raise the marginal cost of dealer 1 who will have to raise his own retail price p\ 
to keep maximizing profits (as shown in eq. (5.19)); hence dealer 2 will raise his price in 
response. 

If the upstream manufacturer can aspire her dealer's profits %\ = {p\ -C\- u>\)Di using 
a subscription fee F\, then she earns the payoff U\ - F\ + w\Di = m + w\D\ = {p\ - c\)D\ 
which happens to be the profit of an integrated firm. We claim that it is a good idea to 
raise w\ above zero because the direct effect is nil (envelope theorem) while the strategic 



(indirect) effect has just been shown to be positive. Formally, the retailer maximizes it\, 
thus |i = implying that §| = w^ < (= if w x = 0). Since §| = ( Pl - a - wi) §^ > 0, we 
obtain 4|£ = fgi|^+<giigi>o given that both retail prices increase with w x . 
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Raising the wholesale price increases retail prices but also profits as the two competing 

goods are strategic complements. 32 ® It is obvious that the competing manufacturer will 
use the same device raising further the consumers prices and the industry profits. The 
limit of this process occurs when the marginal loss due to forgone sales (direct effect of 
j^-) equates the marginal benefit due to increased retail prices. 

If, on the other hand, the downstream competition is in quantities, the clever strat- 
egy for the manufacturer is to subsidize her retailer by lowering the wholesale price 
to the maximum. 33@ Indeed, quantities being strategic substitutes, retailers earn more 
whenever their marginal cost is lower. This time, the subscription fee will be very large 
since it sums the surplus earned by the retailer to the total subsidization amount. As 
we already saw with forward markets and managerial compensation, the subsidization 
strategy is individually rational for a manufacturer but catastrophic at the industry level 
since lower retail marginal costs mean higher sales and therefore lower industry profits. 
This occurrence of the prisoner's dilemma is entirely due to the fact that quantities are 
strategic substitutes. 

6.4.3 Strategic Debt Commitment 

In another example of strategic commitment, Brander and Lewis (1986) blend the finan- 
cial strategy and the product market strategy of a firm to offer a novel explanation for 
the use of debt (in favor of equity) in the capital structure. 

Intuition 

The argument draws on the "asset substitution" effect of debt explained in §23.2.1: debt 
finance together with limited liability turns a firm manager into a risk lover who tends to 
overestimate (resp. underestimate) the positive (resp. negative) consequences of strong 
(resp. low) consumer demand. As a consequence, the manager behaves more aggres- 
sively in the market thereby forcing his opponents to retreat in a manner similar to 
the case of manager's incentives. In short then, debt makes a firm look tough to her 
opponents and we might say that issuing debt ex-ante embodies a beneficial strategic 
commitment similar to a Stackelberg first mover advantage. 

Nevertheless, the use of debt is no costless because of moral hazard. Indeed, when 
the firm goes bankrupt (in a state of very low demand), the debt-holders fail to be repaid 
entirely; anticipating this fate, they require a higher interest rate for the funds they lend. 



It is therefore doubtful that firms will emit debt to take advantage of the aforementioned 
strategic effect. To lift this suspicion, we note that when the firm is unlevered (zero debt), 
taking on a little debt has no financial effect because bankruptcy is not a concern (it is 
highly improbable). However, any level of debt changes the firm's market behavior so 
that the strategic effect is always significant. Hence, it pays to strategically increase 
debt above zero even though the optimal leverage might be quite limited. 

As often with strategic commitments, the prisoner's dilemma looms large because 
if a strategic move can bestow an advantage, it tends to be adopted by all contenders. 
Hence, all rival firms increase their leverage and become more aggressive competitors. 
In equilibrium, the price falls, the overall efficiency of the market increases 34 ® and prof- 
its shrink. 

Model f 

We now formally demonstrate these assertions. The profit of one firm is n{x,y,a) where 
x is her own decision, y her competitor's and a > is a macro-economic indicator whose 
law is given by the cumulative distribution function H. Without loss of generality we 
assume ff > i.e., a higher a indicates a better economic environment. The strategic 
market variable is chosen so that & < i.e., it can be quantity, advertising or the negative 
of price. The crucial element of the analysis is whether j^ is positive or negative. 
Positiveness could arise from cheaper inputs that reduce the marginal cost or greater 
willingness to pay of consumers or positive demand shock (check in the Cournot formula 
§5.4 the effect of parameters over the best reply). Negativeness might occur if firms 
compete in advertising since there is not much point to advertise when everybody wants 
to buy the good (the marginal effect of advertising is dampened by a positive demand 
shock). We perform the analysis for the most probable case of j^ > assuming for ease 
of presentation that x and y are quantities. 

After eliciting a debt level D and collecting the proceeds B of her sale to investors, 
shareholders are left caring for the expected value of profits net of debt repayment, de- 
noted S. Whenever n < D is realized, there is bankruptcy because the debt obligation 
cannot be met entirely. In that case, the firm is closed and the shareholders walk out 
safely because they are protected by limited liability 35 ® Defining 9 as the unique solu- 
tion to n{x,y,6) = D, we have S = f^° [n{x,y,a) -D] dH{a) which satisfies 
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Since D does not appear inside the integral and, by construction, || > 0, we have 
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In the market competition, shareholders choose the strategic variable x to be a best 
reply against the choice y, they expect from their competitor i.e., x solves gf = 0. As 
made clear by (6.20), an increase of debt has the unambiguous effect of increasing the 
marginal value ||, thus the entire best reply function shifts up with debt: |§ > 0. 36@ The 
new equilibrium can be analyzed with the help of Fig. 6.2 pertaining to the study of 
manager's incentives. Whereas the unleveraged equilibrium is symmetric, the new one 
(shown by the circle) features a larger quantity for the strategic player (|g > 0) and a 
smaller one for his competitor (^ < 0). Hence, when a firm increases her debt, she earns 
more in the market. 

Whether any debt is emitted at all depends on the balance between its pros and 
cons. As we abstract from bankruptcy costs and tax advantages of debt, the cash flow 
is always distributed in its entirety to either shareholders or debtholders. 37@ Hence the 
total payoff to firm owners B + S is simply the expected profit V = J^°n{x,y,a)dH{a). The 
effect of a debt increase is indirect and goes through the change in quantities, it is 
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The first term in (6.21) is zero by the envelope theorem since the firm elicits x to 
maximize profits (cf (6.19)). Given that this first term is an average, || must be strictly 
negative at a = 9; now as || is increasing in a, the second term is negative and represents 
the loss in debt value. Lastly, the third term is the positive strategic effect upon the 
competitor of strengthening one's behavior in the product market (recall that ^ < and 
ji < 0). It remains to observe that the second term is zero when debt is initially nil 

(6 = in that case), thus |g > which means that, in equilibrium, both firms emit 
strategic debt. As can be checked on Figure 6.2, the final Nash quantities are greater, 
thus the strategic use of debt is also welfare enhancing since it increases the intensity of 
competition in the consumer market. 

The analysis so far has been conducted with quantity as the strategic variable. Since 
prices and quantities tend to yield opposite conclusions, it is worthwhile to take a closer 
look at the case of prices. If we keep interpreting a as good news for marginal profit, 
we have j-^ > o j^ < following our convention x = -p. We then obtain |g < o 

oYj > i.e., higher leverage motivates the firm to raise her price. Since prices p and p 
(the competitor's price) are strategic complements, we have -£- > thus ^ = ■£ -^ > o 



ajj < following our convention y = -p. Using the fact that ff > o |^ < 0, the analysis 
performed over equation (6.21) remains identical for prices: the first term is zero, the 
second one is zero at the unleveraged benchmark while the third one, being the product 
of two positive terms, is positive. We may conclude that the strategic effect of raising 
leverage is beneficial when firms compete in prices. 



Chapter 7 

Economic Rivalry: Contest and 
Conflict 



In this chapter, we leave market-mediated competition to address the more general no- 
tion of rivalry, when economic agents, driven by self interest, interact in a strategic 
manner. This chapter, using the tools of game theory, will provide us with models useful 
for Industrial Organization and beyond, for all microeconomics.^ We distinguish orga- 
nized from spontaneous rivalry under the headings of contest and conflict. 

The plan of the chapter is not linear. After an introduction linking rivalry to more 
traditional economic concepts, we study purely wasteful conflict under a guise of as- 
sumptions to test predictions. Then, we move to the arbitrage between production and 
appropriation, the so-called productive conflicts. In the third section, we formalize a 
number of stylized model of strategic interaction pertaining to political economy which 
from our 10 point of view are all the relationships taking place outside markets. We also 
analyze various refinement to understand the impact of risk, group play or information 
asymmetries. Lastly, we study patent races and long lasting conflicts known as attrition. 

7.1 Introduction 

7.1.1 Typology 

Economic rivalry is best introduced with a few quotes: 

Edge worth (1881) The first principle of economics is that every agent is actuated only 
by self-interest. The workings of this principle may be viewed under two aspects, 
according as the agent acts without or with, the consent of others affected by his 
actions. In wide senses, the first species of action may be called war; the second, 
contract. 



Pareto (1906) The efforts of men are utilized in two different ways, they are directed to 

the production or transformation of economic goods, or else to the appropriation of 

goods produced by others. 
Lerner (1972) An economic transaction is a solved political problem. Economics has 

gained the title of queen of the social sciences by choosing solved political problems 

as its domain. 
Williamson (1985) Homo economicus engages in a full set of ex ante and ex post efforts 

to lie, cheat, steal, mislead, disguise, obfuscate, feign, distort, and confuse. 

Edgeworth (1881) stresses the dichotomy between violent and peaceful pursuit of 
happiness whereas Pareto (1906) assumes a modern legal framework where violence 
has been eradicated. He then emphasizes the dichotomy between productive and unpro- 
ductive enrichment. Following the latter, we disregard coercion and violence so that we 
fully adhere to individual rationality in the sense that economic agents freely enter into 
contracts over items that have recognizable property rights (cf §8.1 for a justification). 
Lerner (1972)'s quote now makes full sense: there is a complex economic life beyond the 
overtly simple markets and its idiosyncrasies must be understood. Lastly, Williamson 
(1985) reminds us that there are many legal or at least non violent means to pursue one's 
objective. These are a potential source of inefficiencies that contracting can tame, if used 
properly. 

Our typology of lawful appropriation for spontaneous and organized rivalry is conflict 
and contest, examples of which are shown in Tables 7.1 and 7.2. 2@ 

This typology is far from perfect. For instance, advertising is not a face-to-face contest 
between two firms since consumers are involved. The advertising mechanism builds on 
the fact that no customer is bind forever to his current provider; this means that he can 
switch at will if the promotional effort of a competitor is convincing enough. 

A patent race is not a peaceful conflict revolving on appropriation. Firms seek the 20 
years legal protection bestowed by a patent because of its market value which originates 
in the service it renders to consumers; it is thus to some extent socially useful although 
this benefit must be mitigated by the cost of developing the innovation (cf. §12.2.3). 

Rent-seeking is an implicitly organized contest since its very existence arises from a 
decision taken by the State. To avoid rent-seeking, it is enough to award the license at 
random (using a lottery) but by doing so, the State bureaucrat foregoes a useful interac- 
tion with lobbyists. It does not matter whether he seeks a bribe or relevant information 
in order to make an efficient choice. 

Procurement mixes contest and conflict because a contender will make both a pro- 
ductive investment to develop a good project and a power (lobby) investment to gain 
influence over the jury (cf. §14.4.1). 



Advertising and marketing as non price market competition (cf. §11.5). 
Rent-seeking: vying for State awarded economic rents (either by a bureaucrat or a politi- 
cian). Applies in international trade policy, industry regulation, price support schemes, 

privatization or subsidies (cf. §16.3). 
Innovation and Patent races see firms invest into R&D to develop drugs, technical devices or 

ideas (software) in order to either clinch a leadership or receive a legal monopoly for 20 

years (cf. ch.12). 
Litigation in court over ill defined property rights (cf. §14.3), especially intellectual ones 

(cf. §12.3). This wasteful activity is a key ingredient of transaction cost economics (cf. 

§13.3.3). 
Influence over tax rates or wages indirectly through the media or directly through strikes to 

pressure a decision maker, be it a court, a government or a firm. 
Political competition for elected offices by swaying electors (public choice theory). 
Attrition: battle to control a new technology or standard (cf. Bertrand Paradox in §7.4.2). 
Externality: congestion (cf. §25) or depletion of a common resource that force other to bear 

extra cost (indirect conflict). 



Table 7.1: Examples of Economic Conflicts 

Apart from most auctions where payment of bids is contingent on winning the item, 
contests and conflicts tend to display an "all-pay" structure whereby everybody, winners 
and losers alike, forfeit the resources expanded to win the prize. This feature tends to 
make the aggregate effort or investment large, possibly greater than the prize. In pro- 
ductive contests, this is a desirable property while in the case of unproductive conflicts, 
this is a source of inefficiency. 

7.1.2 Origins of Economic Rivalry 

A conflict arises when a valuable item has weak property rights, so that it becomes 
cheaper for some agents to engage in a struggle over possession rather than buy it. 
The means so employed can be legal (e.g., litigation, influence, pressure), can border on 
illegality (e.g., white collar crime, creative accounting), involve deception (e.g., fraud, 
theft, debt defaulting, lying about relevant characteristics) or, at the limit, use violence 
(coercion, extortion, outright aggression and war). Since the legal system is not able 
to perfectly protect us from these risks, we are forced to spend resources at protecting 
ourselves; as shall be shown later on, this is a major source of inefficiency and waste. 

Contests are more specific to the economy since they may be viewed as optimal re- 
sponses to market failures. In the oligopoly models of this book as well as in the perfect 
competition paradigm, the interaction of firms is mediated by the market through prices. 
There are however many reasons why the (anonymous) market may fail to perform the 



Auctions: the seller puts an item for sale with the hope of attracting many bidders and 
earning the largest possible income (cf. ch.22). 

Beauty Contests: an implicit auction where contenders expand effort in a more or less licit 
manner to impress the jury and win the prize e.g., Olympic games locations, license 
awarding (cf. §22.1.2). 

Procurement: Firms or public bodies design a set of requirements and let contenders strive 
to achieve them at the lowest cost (cf. §14.4.1). 

Tournaments: Firms and organizations use relative performance tools to decide on promo- 
tions or design reward schemes with the varied aims of ranking (cf. screening in 21.2) 
or motivating employees (cf. incentive theory in §20). 

Regulation of public services providers: tournaments are used to gather discriminating in- 
formation regarding firms and be able to reward and punish them (cf. yardstick compe- 
tition §17.3.2). 

Education: a filter to assign people to jobs or schools (cf. screening in §21.2). 

Sport: competitions where the objective function of participants goes beyond the maximiza- 
tion of monetary rewards to include fame and other forms of social recognition. 

Scientific Prizes: intended to foster effort, imagination and ultimately advance human 
knowledge (cf. §12 and Bays and Jansen (2009)). 



Table 7.2: Examples of Contests 

desired exchange. To name a few, property rights are difficult to define or enforce, the 
product specifications are not standard, the traded item is indivisible or there is asym- 
metric information regarding the value and cost of the item. These transaction costs 
may lead the parties to engage in direct contracting (which is the object of Part F on in- 
tegration). An implicit assumption of face-to-face relationships is that the parties know 
they can achieve a mutually beneficial trade together. In many situations, however, each 
prospective trader has to search for the ideal partner; it thus makes sense to organize 
a contest; the rules and prizes will be designed to elicit the private information of con- 
tenders (adverse selection) and/or motivate them toward maximum effort (incentives). 
An organized contest is thus strategically similar to a conflict but it is productive as it 
serves the promoter's objective, which is often aligned with society's own values. 3 ® 

7.1.3 On Rivalry and Efficiency 

According to von Weizsacker (1980), economic competition can take place at the levels of 
consumption, production and innovation to which we respectively associate the concepts 
of allocative, productive, and dynamic efficiency. 

Direct competition (aka rivalry) in consumption is generally prohibited by property 
rights (e.g., A cannot lawfully compete with B to occupy fi's house) but competition for 



the exchange of the property rights relating to consumption is lawful (e.g., A can com- 
pete with others to buy B's house). Restricting competition at the level of consumption 
(by upholding property rights) gives incentives for productive activities and hence for en- 
hancing the means of consumption. Analogously, intellectual property rights (cf §12.3) 
prohibit some forms of competition in production in order to stimulate competition in in- 
novation, which in turn expands the means of production. We cannot then talk of more 
or less competition, but of more competition at one level and less at another. The limits 
set by law upon economic rivalry aim to uphold the Holy Trinity of efficiency: 

O Productive Efficiency parties invest resources into welfare enhancing activities (seller 

minimize production cost, buyer maximize value of use). 
© Allocative Efficiency parties create value through exchange (from low WTP towards 

high WTP). 

© Dynamic Efficiency resources are efficiently invested into economic sectors using 
the return on capital criteria. 

According to Usher (1987), of all forms of rivalry, theft epitomizes an antithesis of the 
Holy Trinity, we choose to call the Wicked Trinity: 

® Parties waste resources into offense and defense, not value creation. 4 ® 

© Value is reduced either by the destruction of items or their inadequate ownership; at 

best there is a wealth transfer. 
© Economic sectors prone to theft are under-developed as they yield poor returns, thus 

creating a deadweight loss. 

Legal forms of rivalry partake some but not all of these wicked ways. A negative 
externality or a free rider problem features only ©, inadequate investment because the 
absence of property rights prevents the correct allocation of cost and benefits (cf. §2.4.4). 

Over-exploitation of a common pool resource (CPR) features ® and ©, the latter be- 
cause excessive extraction today reduces the available stock for tomorrow and the former 
because, in anticipation of this fate, all users enter a race to the bottom with excessive 
investment. 

Advertising and other profit seeking strategies feature ® and generate © as products 
become dearer and sell less. The opposite of © may nevertheless appear if advertising is 
informative or when the profit seeking strategy has a productive component. 

Monopoly statutes and publicly awarded rents motivate ® (investment into acqui- 
sition or to bar a challenger). Monopolistic pricing then generates © directly as the 
standard dead weight loss but also indirectly because the extraordinary returns allowed 
by monopolization foster too much investment in the sector. 



Tax incidence in public finance studies how the tax base reacts to the tax code and 
is thus about ® (inefficient investments in tax avoidance) and ® (distortion of economic 
activity). 

7.2 Wasteful Conflict 

Our focus is on conflicts such as rent-seeking (cf §16.3), advertising, lobbying or influ- 
ence; note however that beauty contests share strategic similarities. We content our- 
selves to develop some game theoretical models addressing the issue of rent dissipation 
in a partial equilibrium framework i.e., how much of the prize is wasted by the efforts 
to win it. After presenting the symmetric case, we look at a variety of asymmetries that 
dramatically reduce the wealth dissipation. 

In contrast to productive activities in which inputs are combined cooperatively in the 
production technology, the inputs to appropriation are combined adversarially in the 
conflict technology. Game theoretic models of contest and conflict either use a perfectly 
or imperfectly discriminating contest technology i.e., either the strongest side wins for 
sure or she simply has an edge over the rest of the flock. In this section, we adopt the 
latter setting. Auctions, by their very definition, adhere to the former (cf. §22). 

This section encompasses four parts. In the first, we consider the simplest and most 
famous model of contest with exogenous prize and identical contestants. Then, we ex- 
pand this basic staple by allowing many forms of asymmetry among participants. Next, 
we take a different path by endogenizing the prize to show that the contest paradigm 
covers many models of strategic interaction. Lastly, we show how some conflicting situ- 
ations can be brought toward efficiency using taxation. 

7.2.1 Wealth Dissipation 

Many partial equilibrium economic models of rivalry, whether spontaneous or organized, 
rely on strategic contests whereby a prize (exogenous or endogenous) is alloted among 
contenders. In the sharing interpretation, each receives a percentage of the prize value 
whereas in the lottery interpretation, a unique agent, the winner, receives the entire 
prize. Under risk neutrality, both visions are equivalent in terms of induced behavior. 
Like most of the literature, we follow Tullock (1980)'s lottery approach for ease of expo- 
sition. 5@ 

The basic payoff structure involves n participants (denoted i,j), each choosing an 
investment kt (in either financial or human capital). We denote Vt the WTP of player 
i for the prize and by pi his probability of winning the contest (alternatively, his share 



of the prize). Assuming risk neutrality, a party cares only for her expected profit which 
is Tit = ptVi - k t . Most game theoretic models of imperfectly discriminating contests or 
conflicts use the separable contest success function (CSF) with 6@ 

The influence technology ht{.) transforms a monetary investment k t into an amount 
of leverage e ; = ht(ki) that can be meaningfully compared to the challengers' choices. 
As customary in economics for activities performed by humans, we assume decreasing 
returns to scale (h" < < h' t ) so that pt is also increasing concave in own investment. 7 ® 
The payoff of contender i in the contest is 

ndkt) e t 
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where Cf[.) = j-h~ t l {.) is convex increasing. The interplay between investment and leverage 
shows that the canonical model of rent-seeking is one where participants seek a unitary 
prize allocated by a pure lottery but with differentiated cost of participation. 8@ It often 
makes sense to assume a common influence technology with the power function while 
allowing for private valuations i.e., c;(e) = ^- with a > 1. Observe then that a higher 
prize valuation is akin to a greater influence ability. 

By convention, for any list U/)/<n, we define xq = TjXj as the aggregate contribution 
and X-t = I.jjnXj, the aggregate influence of z"s challengers, the FOC of optimal individual 
behavior is easily computed as 



OTTj _ e-j ^ ^,_| ,,2 ,,o-l 

det e 



0= _ 11 = ^ v ,_ e a-l ^ e 2 e a-l = e _. v . (73) 



In many rent-seeking situations, it makes sense to assume a constant and common 
valuation v of the prize because it is determined by the State and has a well known 
market value which contestants cannot alter. In that case, the equilibrium is symmetric 
with ei = e so that (7.3) becomes n 2 e a+1 = [n- l)ve. The individual investment is then 
c{e) = ]-e a = nz \v while the aggregate cost, nc{e), is a proportion 6 = ^=f of the prize 
v; it is called the index of rent dissipation. The equilibrium expected individual profit 
is n* = -(l-— ) > since a > 1. This shows that each contender has an incentive to 
participate, even after factoring the cost of influence. A larger prize v triggers more 
influence activity but as the rent dissipation index is lesser than unity, participants still 
gain. 

The original literature on rent-seeking and regulatory capture assumed a = 1 and n 



large so that the rent dissipation was to be almost complete. As we show hereafter, many 
realistic features contribute to reduce this dissipation. Furthermore, the lottery models 
are ad-hoc and fail to provide a micro-foundation of the interplay between contenders 
and the contest organizer. 

7.2.2 Asymmetry 

The basic model of rent-seeking is extended to allow for heterogenous contenders. The 
general conclusion will be a softening of the previous striking conclusion regarding rent 
dissipation. An example of asymmetric situation is the regulation of a monopoly as it 
involves firms and consumers who differ in their valuation of the rent to be or not to be 
awarded. Indeed, as we saw in 3.2, the consumer surplus rises more when passing from 
monopoly to efficiency than the producer surplus upon moving in the opposite direction. 
It is thus important to assess the impact of asymmetric valuations. 

Asymmetric Valuations 

Hillman and Riley (1989) focus on the pure lottery contest technology (a = 1). The FOC 
(7.3) now reads ejj = (e -e ; -)^- => e, = e (l-e /f;) i- e -> the greater the WTP, the greater 
the influence expanded. Summing over the participants, we get e = ne - e\li) n where 
the harmonic mean is defined by J- = -2;-^-. We deduce the total effort §o = ^Vn i.e., 

V u Yl J V j fl 

dissipation is reduced (w.r.t symmetry) since v n < v n = -ZjVj is always true. 9@ For later 
use, we derive the individual investments of the bilateral case. When n = 2, v = — l —f and 

V( -t- V j 

e = f , thus individual investments are 

2 
V jV 

(7.4) 
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which satisfies the intuition according to which the more motivated participant (greater 
valuation) invests more in order to produce a greater influence on the outcome. 

Lastly, we must check how many contenders are active since the asymmetry of values 
generates an asymmetry in profits. We have ^^ = ^- - 1. For e n+ \ - 0, e = Z,<„ +1 e; = 
?-i<nej = ^Vn at the equilibrium of the contest with n active participants. The ulti- 
mate entrant is thus n such that nv n+ \ < (n - 1) v n . As a matter of example, we consider 
geometric ranking with v\ - v j l for < j < 1. The solution j n to the entry equation 
y(l -y") = [n- 1)(1 -y) means that for y <y n at most n contestants participate. Table 
7.3 displays numerical results and the corresponding dissipation (computed at the max- 
imum WTP). 
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Table 7.3: Contest Performance 



Asymmetric Burden 

A typical case of asymmetric valuations occurs with lobbying over a policy measure (e.g., 
subsidy or tax avoidance) whose potential recipient is specifically group #1. If the govern- 
ment is restricted to budget neutrality the cost v of the policy if implemeted, is spread 
equally over the remaining groups; hence V\-v whereas v t - -^r for i > 1 (payment 
avoidance). Applying (7.3) to groups #1 and #i for a = 1, we have e~\V\ = e\ = e-tVt. Using 
e-t = ei + e_i - et and the fact that e; = -^V in equilibrium, we get e_i^- = e\ + e_i£=l, thus 

gi _ " ~ 3 J! +3 e-i. Plugging back into e_i y = e„, we finally obtain e_i = - w ~2 1 v so that the 
wealth dissipation is 6 = ^"^.f" 1 = , n ~\ - - -K. Lastly, the probability that the measure 
passes the voting hurdle is p\ = - e + \ = 1 - j6. This result rationalizes the observation 
by Pareto (1906) that "A policy measure benefitting one group but paid by all is hardly 
opposed". 



Commitment 

Dixit (1987) studies Stackelberg leadership in rent-seeking games and shows that among 
identical participants, everyone would like to become the leader, a result reminiscent of 
quantity competition. 

Consider then a symmetric duopoly rent-seeking game and assume that one party, 
say #1, can pre-commit her influence effort k\. She will overinvest if the indirect effect of 
her influence is positive i.e., when ^-§p > 0. 10@ By construction, pi + p 2 = l and since an 
investment is always useful it must be true that -^- < 0, hence ^ = -^- < which means 

that overinvestment requires strategic substitutability ^<0o0> g d k g| oc gk p d \ since 
the impact of A;i on 7r 2 runs only through the winning probability p 2 . 

To be able to conclude, we must consider explicitly the separable CSF (7.1) to derive 
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P2 _ h 2 hiih2 hi) x fo u Hence, the leader is motivated to overinvest 

1 >■-<"'•■■ {hi + h 2 r 



when she is the most influential participant. In conclusion, the strongest contestant 
would like to pre-commit in order to extend further her leadership. Observe that this 
conclusion carries to the oligopoly context. 

We can recover this result analytically in the duopoly with pure lottery. The Cournot 
equilibrium is given by (7.4) but as we need the best reply, we have to solve the FOC 

Vikj - kj. The Stackelberg leader maximizes ftt = -^zy/kt - kt hence 



V[k 
{ki+kj) 



'-^ = 1 i.e., ki = 



V 

solves — -M — = 1 => fcf = -j-f- and the follower chooses k F = -p- [2v ,- - y,-l. We check that 

vt > vi o kf> k F . Observing that total investments are k c = — '—j- and A; s = \vu we have 
k s > k c o vt > Vj so that the ability to pre-commit is socially wasteful if and only if it is 
handed over to the most interested player. 

Litigation 

In many conflicting situations brought to justice for settlement, an element of fault or 
misconduct impinges on the outcome. We may cite divorce, patent infringement, wrong- 
ful contract termination, insufficient performance or failure to deliver (a good, a service). 
Ideally, tort or innocence should be attributed with certainty to whom it belongs. Like- 
wise, the outcome should only depend on litigation efforts if there is an equal degree 
of fault and should only depend upon the degree of fault if there is an equal litigation 
effort. Modeling these features after Hirshleifer and Osborne (2001), we show that liti- 
gation can be seen through the lens of the canonical conflict. 

Letting jc,- £ [0; 1] indicate party i's degree of fault and kt her litigation effort, a sensible 
way to meet these criteria is for the ratio of winning probabilities to be y = I j-) I f • 
The positive parameter (5 and y make each ratio more or less influential. For instance, 
the codified legal systems (e.g., French and Germanic) leave little leeway to attorneys so 
that p » y whereas in common law systems (e.g., US and UK) the reverse would hold. 
Since pj = 1 - pi, the influence function is hi(k{) oc kJxT p => aiet) oc (e^xp 1 ' 7 so that we are 
back to the canonical model with individual valuation v i oc x which tends to infinity 
when the agent becomes the deserving owner of the item (xt -* 0). Using (7.4) for y- 1, 
we deduce that the more deserving agent (smaller fault factor) invests more i.e., being 
right motivates to fight harder. 

If the law contemplates this option, conflicting parties can settle their difference out- 
of-court with a side payment (withdrawing their lawsuit and counter-lawsuit). This way, 
they avoid the costly legal expenses of an actual trial, the negative media exposure and 
the inefficiencies associated with a rigid court judgement. However, parties often dis- 
agree wrt. their odds of winning the trial making their demands incompatible. As they 
fail to reach an agreement, trial becomes inevitable. In §7.3.5, we model this failure of 
the Coase theorem. 11@ 

Incumbent Advantage 

Fisher (1985) argues that most rent-seeking contests have an incumbent whose renewal 
is almost assured. As we show hereafter, asymmetrically empowered rent-seekers jointly 
spend less in wasteful rent-seeking activities than if they are symmetrically empowered. 



The result also applies to rent-seekers within an organization, public (e.g., agency) or 

private (e.g., firm) where one stands at a higher rank on the hierarchy ladder. 

Assume that contender #1 is an incumbent whose rent-seeking investment is A > 1 

times more productive than other people's money. The asymmetric winning probabilities 

are p\ = Afc A ^ 1 fc and pt = Ae ^ e for i > 1. For the case of linear cost a- 1, expected profits 

are 

ii\ = A v and %[ = v - kt for i > 1 

\Xki + k-i A; Akt + k-i 

The FOCs are then 

Xvk-i = A; 2 = vk-t for i > 1 (7.5) 

In a semi-symmetric equilibrium, we have kt = k for i > 1. The FOC system (7.5) now 
reads \{n-l)k= (n-2)k + k± => & = (n-l)A-n + 2. Then, the left FOC yields Xv(n-l)k = 



) 2 = fc 2 ((n-l)A + l) 2 ^'- : "'- lllr 
total cost over the prize value i.e 



(A;i + (n- l)kf = k 2 ((n - 1)A + if => k* = (( ^" 1) {^ 2 - Lastly, the degree of rent dissipation is 
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so that j6 - 2 %i when n is large or fi = | when A is large. 

In a typical case with three challengers (n = 4) and where the incumbent holds as 
much effectiveness as they do altogether (A = 3), then rent dissipation is 50% instead of 
75% if players were equally influential. 

7.2.3 Canonical Rivalry 

Up to now, the prize was exogenously given; this suits well the idea of a contest where 
the organizer determines the prize characteristics ex-ante. Many forms of rivalry (all 
conflicts and some contests) however involve an endogenous prize determined by the 
conjoint actions of contenders. In the following generalization, the participants' inputs 
will simultaneously determine the size and distribution of the prize. 

This is the case for instance when economic agents work in team and pool their efforts 
to produce a result which is then divided in proportion of individual contributions. The 
exploitation of a common pool resource (cf §18.2.3) presents a similar situation since 
aggregate effort determine total catch or extraction while nature makes sure that, on 
average, each receives a share of output in proportion to his input. Still another appli- 
cation of this general setting is the Cournot model for an homogeneous good given that 
the price determination pools the efforts of agents, here the individual outputs of firms 
(cf §5.1). 



The canonical model of rivalry takes the canonical model of rent-seeking (7.2) and 
assumes that investments k = [ki)i<„ are perfect substitutes whose aggregate k = Y.i< n kt 
determines the prize V{k ). We assume that investment has always a marginal value 
i.e., V m > but that the unitary value v(ko) = ^j^ is decreasing (a property close to DRS). 
Given proportional sharing of the prize i.e., pi = f-, individual payoff is 

Ttt = -±V{k ) - kt = v{k Q )ki - kt (7.6) 

k 

In the Cournot model with common marginal cost c, V{q ) = — c q is the normalized 
market revenue generated by the aggregate output of firms (cf §5.1). In the Commons 
model, investments are pooled to extract valuable output from the community owned 
stock (cf. §18.1.4). 

The collective benefit is n {k ) = T.i<n n i = V(k ) ~ ko- Observe that in this particularly 
simple model, each participant gets his fair share since we have m = ptn (by construc- 
tion). The aggregate profit is maximum for k^ solving V m = 1, yielding n* Q . In the Nash 
equilibrium, an individual firm solves the FOC 

l = kiV m + v = k t — + v = piV m + (1 - pt)v (7.7) 

/Co 

i.e., marginal cost equal to a weighted average of marginal and average value. 

We immediately observe that the equilibrium must be symmetric because (7.7) yields 
Pi - ^r^-(ko) which is independent of the identity. It must therefore be true that pi = \ 

so that (7.7) becomes 

1 n-lV(Jfco) , v 

l = -V m (k Q ) + r^- (7.8) 

n n ko 

i.e., the marginal cost (LHS) is an average between marginal revenue and average rev- 
enue with weights determined solely by the market structure. Equation (7.8) thus ranges 
from the monopolistic case for n = 1 to the competitive limit with free entry as n grow 
large. Observe finally that the monopoly implements the collective optimum i.e., the inef- 
ficiency brought about by non-cooperative interaction can only be eliminated by forming 
a cartel. Alternatively, the collectivity may use a tax instrument to achieve the same 
efficient outcome as shown in §7.2.4. 

Exogenous Shock 1 ^ 

Kotchen and Salant (2009) inquire the impact of a cost increase (or a value decrease) 
upon aggregates in a contest where technology is common. Removing the cost nor- 
malization used above, individual profit becomes Tit = {v{ko)-c)ki. The FOC for profit 



maximization remains (7.7) but with c on the LHS. Since it is symmetric, we have 
nc = B(k ) = nv{k ) + k v'[k Q ). Symmetry is also used to expressed the SOC in a leaner 
way: 

0>2v +kiv oc 2nv {ko) + koV {ko) oj<2n 



where 7 = —, ( , fco) is the elasticity of the slope of v. We get a unique equilibrium if the 

-V (Ko) 

RHS of the FOC is decreasing i.e., if 

> nB = (n + 1) v (fc ) + k v (fc ) o 7 < n + 1 

Under this condition, -^ < and since k{ = ^, we also have -^ < 0. To tackle the effect 
of an exogenous cost shock, we compute 

dm dmdki dnidkj ' rl *, dk j , 

= + (n-l) i -l = + (n-l)y (H) — --1 

dc dkt dc dkj dc dc 

-, . dkj dkj rthn 11 i . 

and since -r-f = -^-rr = ^-V, we obtain 

dc fco dc w a ' 

— L = j - 1 > o {n - l)v' {kZ) > nB {kn) o y > 2 o 2v' (Ar ) + A; v" {k ) > 

dc n p 

which is equivalent to V being convex (recall that V{k) = kv{k)). 

Thus, when there are sufficiently many firms (n not too low) and value is convex 
at the equilibrium, we may have the paradoxical result that a cost increase benefits 
all members of the industry (case where 2 < 7 < n + 1 holds true). In the pure Cournot 
model studied by Seade (1985), O(x) = x, thus v{.) = P(.) and 7 > 2 is equivalent to local 
convexity of market revenue qoPiqo) around the equilibrium. In our leading linear de- 
mand example, market revenue is concave thus the paradox does not occur. However, it 
will arise if the demand displays a low constant elasticity e since in that case we have 
7= l + l/c>2oe< 1. 

7.2.4 Enforcing Agreements 

Consider a strategic interaction where the Nash equilibrium is deemed collectively inad- 
equate because society's members are trapped in a prisoner dilemma (cf §2.4.1). Think 
for instance of the over-exploitation of a natural resource under open-access wrt. private 
property management (cf. §18.1.4) or the oligopolistic equilibrium of an industry wrt. 
the monopolization by a cartel (cf. §9.1). 

We shall prove that when firms or agents are only mildly asymmetrical, the ideal out- 



come can be reached if the collectivity can institute a tax system and redistribute part 
or all of its proceed without incurring too large a transaction cost. Setting adequately 
this Pigouvian tax, each member remains free to pick his strategic decision. 12 ® The op- 
posite situation where the non-cooperative equilibrium features an under achievement 
is solved in a symmetrical fashion with a subsidy instead of a tax. 

Theory 

Consider an economic interaction summarized by the game (^y(q)) ;<n where q = {qj)j<n 
indicate strategies. We designate the Nash equilibrium with exponant symbol * while 
* stands for an ideal situation for which we assume aggregate payoff dominance: A = 
^o - ^o > where no = Y,j< n nj and n* Q = Lj<„7rt . 

The unit taxes are t = (Tj)j< n , the revenue collected T = T.j<n T j c lj is redistributed 
according to shares A = (Xj)j<„ satisfying Xj > and Ez<„A ; - < 1 to guarantee that the 
scheme does not run a deficit. The modified game has payoff ndq) = ndq) -tiqi + AjT. 
The FOC characterizing the player's best reply is = ||j- = |^- - CI - Aj)t*. The unit tax 

T; needs to solve the previous FOC for q* i.e., n = jz^ where f3 t = 3 
larger fraction At of tax revenue leads to a larger tax rate t ( . 

If T,i< n Xi = 1 (i.e., there is full refund), then total profit in equilibrium is n Q = n* - 
Hi^nTitfi +T.i< n XiT = tiq-T +T = ttq. If the starting situation is entirely symmetric in 
terms of preferences [jij (•))•<„ , ideal output q* and redistributive shares A, then tax 
rates are also symmetric. In that case, individual profit is just the fair share of total 
profit and it is obvious that all players gain by A/n. By continuity, the result must carry 
to a mix of incomplete refunding and mild asymmetries, the individual rationality (IR) 
condition for participation being 
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or in aggregate T.i<nMlL]?i jz^: - Lj<njS^*-A. If some of these inequalities are violated, 
one needs to compensate involved players in an ad-hoc fashion which is likely to be 
impractical and/or unfeasible. 

If firms are identical, it makes sense to consider a common share A and the (IR) 

condition becomes n < Ji* + ^/5q* o ^ ^ a = ^ = W ^ X ~ ~ X ~ fcs- For any A £ ^' nl' 
firms end up better off. The maximum untax revenue that can used to different purposes 
(double-dividend) is simply the prize for cooperation A (= (l - An) T in this case). 



Cartel 

In the cartel case (cf. §9.1), the ideal is the monopoly outcome and the underlying inter- 
action is Cournot competition for the market demand D{p) with individual cost functions 
Ciiqt). The cartel's optimum level of sales q* = Lj< n g* solves P{q^) + q^P'iq^) = C m {q*) = 
C m ,i(q*) = c*, thus 



''-at 



= P{q*) + q\ P\ql~) - C m<i {q\) = [q\ ~ 4o) P'OfS) = V*-(p' ~ c*) 



If firms are identical then p = ^(p*- c *) = ^-^ so that a = A = -=Ml- ^) < 1. In 
the case where marginal cost is constant, the ratio of industry profits is ^ ' 2 (cf. §5.12), 
thus A = „ 3 "t 1 3 which is quite small. 

Commons 

Contray to what happens in the cartel case where consumers and welfare are hurt by 
the enforcement mechanism, the latter may be used in problems of congestion and over- 
exploitation of commons to bring an inefficient equilibrium towards the efficient outcome. 
As before, the ideal situation is the profit maximizing exploitation of the common that 
would be undertaken if it were under sole ownership whereas the default situation is the 
open access equilibrium. 

Formally, this is like the Cartel since both share the same underlying model of canon- 
ical rivalry (cf. §7.2.3). Hence the lower limit for refunding only depends on the ratio of 
industry profits between cartelization and competition. If we further apply this result 
to the logistic case (cf. §18.2.3), then there is a perfect identity with the Cournot model, 
hence the same lower limit for refunding. 

7.2.5 Productive Conflict 

Investing into a conflict or a contest to improve one's odds of success is, of necessity, a re- 
linquishment of immediate consumption or production, aka. the "guns vs. butter" trade- 
off. 13@ The contested prize then becomes endogenous because it is created by the very 
parties seeking its appropriation. For instance, labor tournaments commonly reward be- 
havior that contributes to increase productivity within the firm, thus the distributable 
profit. Likewise, employees within large scale organization routinely engage into lobby- 
ing because their output and position is insecure as it can be taken away by a challenger 
in a back room deal with the hierarchy. The dilemma is then to allocate optimally one's 
resources between production and appropriation. 



Paradox of Power 

To account for the production vs. appropriation trade-off, we need to approach rivalry 
from a general equilibrium perspective as originally done by Haavelmo (1954). 14 ® To 
ease the approach, Hirshleifer (1988) models Production & Conflict (PC) in a partial 
equilibrium framework where decision makers must allocate their scarce human and/or 
financial capital between creation and capture. 15 ® 

Each economic agent has a maximum welfare contribution q t reflecting differentiated 
productive abilities as well as endowments. Each can invest kt to build influence at the 
rate e ; = ktl A; where a greater A* reflect a lower influence ability (productivity). 16 ® The 
net welfare contribution is thus qt = cji - Xtet. The influence technology is the standard 
one with pt ■ = f 1 . Normalizing maximum GDP to unity (qo = l), the profit of player i is 

e- 1 

iti - Pitfo - — -Xe, with A= — Z/A ; e ; - (7.9) 

This is the rent-seeking setting except that the linear cost has an endogeneous slope. Yet, 

the usual intuition remains correct: an agent better at appropriation invests more into 

rent-seeking. We use the influence e, as strategic variable instead of the investment kt. 

The FOC of optimal power investment is = q -^ + Pi-^r- Since -£j- - ^-p- and -^ - -X t , 

we obtain 

^ = A,-^- (7.10) 

eo 1 - pt 

i.e., the product of individual productivity by the hazard rate of victory is constant over 
all participants. 17 ® Given that the RHS ratio is increasing in p t while the LHS is con- 
stant, a better influence technology (X t < Xj) leads to a greater chance of winning (p t > pj) 
i.e., a larger influence (e ; > e,). Since differential capture productivities impact the re- 
sults, one may wonder if differential creative productivities reinforce or counter these. 
It so happens, in this simple model, that the two productivities play opposite roles i.e., 
being twice better at capture is like being twice worse at creation. 18 ® The two activities 
are thus complementary and we obtain the exploitation paradox: 

I In the production vs. appropriation dilemma, agents more creative or less able at 
capture, earn smaller shares of the overall output. 

This paradox explains the prevalence of rent-seeking when binding distribution agree- 
ments are unenforceable. 19 ® In the case of influence within organizations, the exploita- 
tion paradox implies that those most able at managing power and human relationships 
enjoy better jobs and perks than engineers or technicians. Furthermore, once someone 
has developed an edge into an activity, she is likely to improve further her productivity 



through learning and will therefore completely specialize in it. 

The previous paradox was conditional on a positive influence activity by all agents in 
order for the FOC to apply. Under that proviso, differential endowments do not seem to 
matter. To say more about that we need to solve for the equilibrium. Observe that the 
FOC for (7.10) reads {l-pi)qo = MPi^o = ^t^i = qi-qt so that summing over all contenders, 
we obtain {n - l)q Q = 1 - q (recall q = 1), thus q^ = \ which means that aggregate expen- 
diture on influence, dissipation, is a fraction — of maximum potential output as in the 
lottery model of rent-seeking. From (7.10), we have q* (e - e{) - Xteteo => e* = n *lT and 

summing again, we derive e - I/ - *lT „ => 1 = E/ rn \ /„« which has a unique solution 
6q determining indiviudal degrees of influence. 20 ® With two contenders, (7.10) can be 
worked out to yield the more readily interpretable ^ = JXtlXj. 

We see that at least half of the potential production (value) is lost in wasteful rent- 
seeking with a potential full dissipation limit . 21@ We can now draw some conclusions 
regarding final payoffs in the duopoly case. If parties have the same capture abilities, 
they share equally the value produced, this independently of their initial wealth differ- 
ences (i.e., Xt = Xj and qt ^ cjj => n* = n*). This is so because productive inputs are pooled 
before the contest decides on the division. If one of the agents now improves her capture 
productivity, she ends up investing more into capture, capturing more and earning more. 
However the inequality will remain stable in subsequent periods. The case of a windfall 
income is altogether different. If 5/ accrue to party i, other parties grabs a share 1 - »* 
of it. We thus obtain the paradox of power: 

In the production vs. appropriation dilemma, wealth is equalized inso far as parties 
have the same abilities at creation or capture. Any windfall income for one party is 
necessarily shared with the other in the proportion given by the ratio of creative (or 
capture) productivities, so that income inequality is reduced. 

In the alternative interpretation where A is a creative productivity, endowments are 
inputs for final welfare creation. In equilibrium, endowments are wasted at 50% rate into 
influence. It is quite obvious that if an agent can produce a non appropriable output with 
a productivity greater than i, she will forfeit influence activity and risky (though valu- 
able) production to concentrate on secure (although obsolete) means of profit. This can 
explain why developing countries are autarchic in nature since exports require licenses 
and more risk taking than local production and retail. This argument can be extended 
to innovation: in the presence of a weak protection of property rights (e.g., illegal file 
sharing through peer-to-peer networks), a firm will pass over an innovation if it is too 
easily copied or illegally distributed. 



Settlement 

Meanwhile parties engaged in lengthy negotiations (or conflict) have not settled their 
differences, they must forego alternatives activities, for instance valuable investment 
opportunities. 22 ® More generally, haggling over a distributive issue reduces the amount 
to be distributed. We may thus assume that, either the managers reach an agreement 
(Aj, Xj) for sharing the prize q or they engage into conflict over the smaller prize 9q where 
9 < 1 measures the transaction cost of squabbling. Under this specification, influence 
activity is indeed an investment such as building trust with the CEO that can be used 
later on if no agreement is found with a competing manager. 

Since their disagreement payoff is Oq while the agreement payoff is q, the value of 
cooperation is 6 = (1 -d)q which is shared evenly (cf §2.4.3). To ease calculations, we 
assume equal productivities with X A = A B = 1. The profit is now ni = pt9q + \{l-9)q. The 
FOCs leads to 

^(e P A + la-e)) = q = 1 ^- B (ep B + la-d)) a.ii) 

If e A > e B then p A > p B by the definitions of the probabilities but then this would 
imply e A < e B in (7.11), a contradiction. The solution is thus symmetric with e A = e B and 
Pa = Pb = \- Plugging back into the FOC and using the same method as above, we derive 
e* = 2(b+d which is increasing with 9 whereas final production q* = ^ is decreasing with 
9. As conflict becomes more wealth-destructive (9 \), parties get less incentives to seek 
rent and thus share a greater expected prize. Adopting a warlike language, we may 
summarize this result as: 

I Immediate settlement is thus more attractive, as the weapons at disposition of each 
party become more deadly. 

7.3 Political Economy 

In this section, we apply the basic rent-seeking model to a range of political economy 
issues, among which lobbying, political pressure, collective action or the determination 
of the size of firms. 

7.3.1 Lobbying 

Since legislators, governments and bureaucrats have the power to enact and implement 
regulations or legislations, special interest groups vie to influence them in order to sway 
the outcome towards their interests (cf. §16.3.2). Becker (1983) 23@ analyzes the inef- 
ficiencies generated by monetary transfers between SIGs and the government over the 



choice of a tax-subsidy program which is opposed by tax-payers and supported by recipi- 
ents lobbies. The impact of the State's technology is also assessed. Since the gain to one 
side is the other side's loss, this conflict is a zero sum game as opposed to the constant 
sum games seen up to now (they revolve on the division of a valuable item). 24 ® 

Letting denote Q the monetary amount at disposal of the government, the relation to 
the disposable income of recipients S and the tax-payers contribution T is respectively 
S = F 2 {Q) and Q = -Fi(T) where both functions display decreasing returns to scale (like a 
monopoly's marginal revenue) because the tax collection and distribution systems gener- 
ate deadweight costs (cf marginal cost of public funds in §17.1.2). Pressure determines 
the public decision through the influence function Q = Y{k t , k s ) which displays decreasing 
returns to pressure i.e., Y ? -; < < Y t for i = t,s. As the sign of Y ts cannot be determined 

k 2 k 2 

from intuition, we consider Y{k t , k s ) = k s - -f - k t + -j- + yk s k t with 7 small but of no partic- 
ular sign. We let also FiCT) = B t T with 6 t < 1 and F 2 {Q) = Q/d s with 6 S > 1. This particular 
labeling choice simplifies the derivation of the equilibrium but one must take care that a 
higher 9 S worsens the distribution technology while a higher 6 t improves it. The utilities 
are then 

ln t = -T-k t = -^-k t ^ FOC \o t = -r t = i-k t - T k s 

I n s = S-k s = ^j^-k s I 6 s = Y s = l-k s + T k t 



best reply: { ' => equilibrium: < 

" ' ■■1-0 S + Yk t 



, , _ l-6> f -y(l-6> s ) 

K t ~ l+ y 2 

, , * _ l-6> s +y(l-8 f ) 

K S ~ l +J 2 




Notice the "reverse slope" property of the best-replies which is due to the zero-sum nature 
of the game at hand. The equilibrium transfer is then Q* = -^ (7 (1 - 6 s 6 t ) - {0 S - 8 t ) ^^A 
while indirect utilities are 

oc 6 2 s -2 + 6 t {2-6 t )-2y6 s {l-6 t ) 
oc e 2 t -2 + 6 s {2-6 s )-2y6 t {6 s -l) 

If recipients reduce their internal free riding, they are able to apply pressure k s at 
lower cost, as if n s = S - Xk s for A < 1 but this is equivalent to set n s = jQ-Y{k t , k s ) - k s as if 
6 S was smaller. The effect is to increase own pressure k* at the rate j^ and decrease 
that of opponents at the much smaller rate j£s> The government is thus pressured 
more and the outcome changes in favor of recipients, though not much. The analysis is 
symmetric for taxpayers. 25@ Hence we may conclude that 



I The political effectiveness of a group is determined by its relative efficiency, not the 
absolute one. 

We know study how changes in the efficiency of the State machinery impact lobbies. 
A larger 6 parameter amounts to a worsening. We have -^- oc -6 s -6 t j < and -^- oc 
dt-dsj > i.e., both a greater distribution efficiency and taxation efficiency increase 
the final transfer. This means that the large redistribution programs we observe are 
in fact the " cheapest" to implement. In other words, a higher marginal deadweight 
cost (worse State machinery on either side) reduces the size of the program. This is 
because recipients have little incentive to make costly political investments in support 
of redistribution when it is carried out by highly distorting policy instruments while 
taxpayers, on the contrary, have a strong incentive to oppose them. This does not mean 
that everybody wants to scrap a program based on an inefficient transfer technology. 

Regarding this issue, political competition tends to promote efficient means of taxa- 
tion because both groups prefer a program with low deadweight cost of taxation. That 
taxpayers prefer an efficient taxation system is clear (^ oc 1-6 1 +yd s > 0) but since there 
is complementarity, recipients like it too because the low effort of taxpayers enables them 
to invest little (^ oc Q t -j{Q s -l) > for small y). There is no such clear cut conclusion 
with the distribution system. Not surprinsingly, recipients are in favor of an efficient 
system (§^oc l-Q s -j8 t <Q) but taxpayers will oppose it because it attracts large politi- 
cal investments from recipients, thus forcing them to respond accordingly with a larger 
investment (^ oc 6 s -y{l-d t ) > for small y). This may explain why the tax system 
appear rather efficient (at least in advanced countries) whereas subsidies and other re- 
distribution program are almost always seen as money "thrown by the window". 

Note finally, that the general inefficiency of rent-seeking for political favors derives 
from the partial equilibrium assumption that any investment into lobbying is a waste. 
Whenever we cease to view the State as an inert black box but rather as a monopolist 
who auctions favors, efficiency is restored (cf Martimort and Stole (2003) on common 
agency). 26 ® 

7.3.2 Capture and Pressure 

In most countries, overtly corrupt politicians are defeated by honest ones. Dal Bo and 
Di Telia (2003) show however that threats and smear campaigns applied by lobbies may 
capture them and produce similar outcomes, though for quite different reasons. As con- 
sequence, honest politicians must endure negative pressure and earn less than their 
market equivalent wage which is conducive of entry of untalented people into the activ- 
ity. 



Consider then an honest politician who must choose between policies supported by 
lobbies A and B. It is easier for exposition to associate A with efficiency (i.e., supported 
by the majority of the population) and B with one special interest group (i.e., supported 
by a minority). Choosing policy A over B has two consequences for the politician. On 
the one side, she will gain electoral support and increase her probability of remaining 
in power but on the other side, she will have to endure the offensive pressure of the 
disgruntled lobby B. At this point, lobby A will apply defensive pressure to counter-act 
B's influence. 27@ Letting a be the pressure applied by group i when policy A is chosen 
for i = A,B, the net offensive pressure endured by the politician is p A = e B - e A - In this 
simplistic model, p B is normalized to zero since lobby B is happy with policy B whereas 
lobby A cannot overturn it. 

The lottery (contest success function) is then introduced through a macro-economic 
shock 9 > that affects negatively the economy before the politician is required to make 
a decision. For policy i = A,B, the probability of losing the next election is the increasing 
function Gi{0) i.e., it is harder to win an election in times of depression. The electoral 
support gathered with implementing policy A over B means that the wedge G B - Ga is 
positive; as attested by empirical studies, we can further assume it is increasing with 
the size of the shock i.e., in crisis times, there is strong electoral support for reform 
(choosing the efficient policy over the "business as usual" one). 

Normalizing the utility of remaining in power to unity, the expected utility of the 
politician is Ui{8) = 1 - G ; (0) - p l for choice i = A,B (assuming net pressure to be positive). 
We have 

u A {6) > u B m o G B {6) - G A {6) >p A o6> 6(e B - e A ) 

where is the increasing inverse of G b -Ga- This means that the politician implements 
the efficient policy only after a large enough shock. The probability of a small shock is 
then the success probability of lobby B; its profit is thus n B = v B Vr{0 < 6{e B - e^s) - \k B e\ 
where X B is the inverse productivity of the pressure technology. A symmetric formula 
holds for 7i a- 

In the absence of offensive pressure from lobby B (when policy A is chosen), lobby A 
enjoys the maximum probability of success, thus it does not engage into defensive pres- 
sure. Clearly, given this nil defensive pressure, it pays to be offensive for B since is 
increasing and we assume decreasing returns to scale in the production of negative in- 
fluence. As in the Hotelling model, the defensive side will respond with some counter 
pressure. 28 ® An equilibrium is found because both pressure technologies displays de- 
creasing returns to scale. 29 ® 

Since both sides invest into pressure, there will be a positive amount of negative 
pressure in equilibrium and thus the efficient policy is delayed whenever the shock is 



small because the honest politician fears retaliation from the lobby. However, in times 
of crisis, policy changes hurting incumbent lobbies are realized although they strongly 
oppose them. Note also that the politician being subject to pressure, he earns less than 
in a world safe of smear attacks. This inability of public office to pay adequately its 
servants may explain the failure to attract the most productive people (in a classical 
sense) and why those with special talent for influence end-up there. 

7.3.3 Collective Action 

Individuals rarely engage into lobbying or rent-seeking for their own sake, rather they do 
it on behalf of the group they belong to or represent (cf §2.4.4). Olson (1965) argues that 
small groups are more effective because their members have greater incentives to invest 
effort into the group activities. Indeed, the per-capita stakes are higher and shirking has 
more deleterious consequences on the group's chances of success. Hence, group members 
exert a greater effort and the group succeeds more often than a comparatively larger one. 
It has been argued that this free rider reasoning is only correct if the prize has a private 
nature (i.e., to be divided among group members) whereas if the prize is enjoyed as a 
public good by members, then larger group produce a greater amount of effort. Esteban 
and Ray (2001) address this wisdom in a setting where group benefit are equally divided 
among members. 

Let etj and V{j respectively denote effort and valuation of the rent by member ;' of 
group i. The size of group i is m,-, its aggregate valuation for the prize is vi = Ej< m , Vi,j 
and its total effort is e ; - = L/< m; etj. Lastly, e = T,i< n e i denotes total effort by contending 
groups. The probability that group i wins is taken to be pi = j. It is clear that in 
a small group, etj is an important share of et, thus an individual contribution hinges 
on the success probability. By setting the individual payoff to be their fair share vtj = 
^t:, we ascribe high per-capita stakes to small groups and also maximize the free rider 
problem since individuals get a poor share of the returns on their efforts. The expected 
utility is Jij = piv it j - c{ej) so that the FOC of optimal effort is -^- = 0o ^~§ Lv i - e< t~; 1 - ^ n 
equilibrium, all members of a group perform identically thus eij = — and the condition 
becomes e"~ l e 2 = (e-edVjjnf' 1 . We assume next that groups are of equal size m and that 
the prize is commonly valued (by everyone) at v . From e = net, we obtain e" = l ^vm a ~ x , 
hence e? . . = -^a and total cost is nm^- = ^p v which is less than the prize value to a single 
individual. 

When the prize is fully private such as a protective quota or a monopoly rent, its social 
value W is to be divided among the members of the winning group, thus v = ^. As soon 
as a group represents dozen of members as in the typical trade or industry association, 



the rent dissipation is limited to a few percentage points. If, on the contrary, the prize 
has a public dimension for the group as in the case of a protective law, the location of a 
public facility or the specific characteristics of a public project, then v is how much each 
individual winner values the public policy toward himself. The aggregate value for the 
group is thus mv and quite often the aggregate value for other groups is positive since 
the project also serves them, although not as they originally wished. The social value W 
is thus again in excess of mv and the previous conclusion continues to hold. 

7.3.4 Optimal size of the Firm 

Muller and Warneryd (2001) offer a rent-seeking explanation for the observation that 
partnerships never grow too big i.e., go public above some revenue threshold. A recur- 
rent problem for a partnership is free riding among the partners as each is tempted to 
appropriate more than his fair share of the firm's profits through accounting fraud and 
other siphoning techniques (cf §13). If the firm goes public, the managers still tempted 
to appropriate the firm's wealth will have to go through two stages of bargaining, first 
as the board of directors against the board of shareholders and then among themselves; 
because the prize to be divided among them is already smaller than under partnership, 
their rent-seeking incentives are diluted. Although owners also dissipate resources on 
their side, it is enough that they be less numerous than managers for total dissipation 
to be reduced by the incorporation. 

Let us illustrate these claims formally. If the firm's profit is a prize v to be divided 
among the m partners, their fair share is j-. In the lottery model of rent-seeking, each 
partner burns m= r- v in influence activity, his expected return is only -^ while the index 
of wealth dissipation can be written as /3 = m =^. If the firm goes public and end-ups with 
n shareholders (or outsiders); former partners are now managers (or insiders) and must 
compete against the outsiders for the prize before competing among themselves to divide 
whatever they got. 

Let tt (resp. St) denote the effort of an outsider (resp. insider) and T = Y.j tj (resp. 
S = E; rrij) the aggregate effort of the group that defines the share -^ (resp. j^) of the 
prize accruing to that group. An outsider (resp. insider) expects a share \ (resp. -\) 
of the prize won by his group, thus his overall expected payoff is u>i = \-^pf - ti (resp. 
u k = -^2^f- Sk)- The FOC system is 



i _ du k _ n _ dwi _ v S _ i _ „ _ n 2 {S+T) 2 _ m 2 {S+T) 



m 2 [S+T) 2 A ~ dm k ~ u _ dt t ~ n 2 {S+T) 2 x v ~ S T 



hence the solution is T = - ™ v — 2 and S = - " " 2 (with S+T = -^-^ and ^r = -£-?)• 

{m 2 + n 2 ) 2 {m 2 + n 2 y m 2 + n 2 S+T m 2 + n 2 ' 

The wealth dissipation sums for both insiders and outsiders what they spent in the two 



stages i.e., 

R — S 4. m—1 S , T , n-\ T 
V~v^ m S+T^v^ n S+T 

so that 

ft _ a _ 1 , T l n-\ _ m-\ \ _ n(m+l]-m 2 
" P~ m 2 + n 2 S+T v n m > (m 2 + n 2 )n 

thus it is basically enough to have less outside owners (n) than inside managers (m) for 
dissipation to shrink with incorporation. 

7.3.5 Coase Theorem 

In §8.1.3, we recall the conditions under which a laissez-faire policy can be efficient to 
solve entitlements problems. We show here that this conclusion must be amended when 
we take into account the likely conflict over the allocation of property rights and the 
possible ambiguities in this process. 

Transaction Costs 

In an ideal world, items have well defined property rights and no transaction cost hinders 
exchange so that the final owner of a contentious item is the one with the highest WTP for 
it, no matter who initially owned it. Yet, as final payoffs depend on the threat point i.e., 
the initial ownership, we can expect parties to effort themselves to sway the awarding in 
their favor. 

For instance, the owner of a patent for extracting cheaply tar sand will probably never 
use it because he will be bribed enormously by oil extractors from all over the world to 
keep it in the drawer. He will thus act (produce or consume) as if the patent did not 
exist and full efficiency for the energy industry will be achieved (they avoid a destructive 
competition). Yet the profit made from threatening to use the patent is a very strong 
incentive to acquire it in the first place. Likewise, the final days of a war before parties 
sign the armistice are the most ferocious since each army tries desperately to conquer 
additional land before the cease fire enters into force. The initial allocation of property 
rights should therefore be pressure-proof to discourage socially wasteful lobbying. 

Robson and Skaperdas (2008) formalize this issue in a simple partial equilibrium 
framework and show that reducing transaction cost need not always improve efficiency (a 
Laffer curve result). If S (seller) and B (buyer) contend for the property of an item they 
value respectively c and v > c, then efficiency calls for B to be the final owner. Upon 
expanding influence investments ks and kg, B is awarded ownership with probability 
Pb - Y~+k * n which case he keeps the item. With probability p s = l- p B , S gets the item 
and sell it to B for the price ^ (cf §2.4.3). Transaction costs are introduced by assuming 



that trade fails with probability a < 1. Profits are thus 



n B = PBV + pstt-rt^-ks =p B V B -k B + (l-a) R 2 £ 
n s = psitt-rt^ + acj-ks =PsV s -k s 



where V B = v - (1 - a)^- and V s = (1 - a) ^ + ac satisfy V B + V S = v + c. Applying (7.4), 
the equilibrium influence investments are k* = ^^ 2 for i = S,B while p* B = v v ^ v . The 
welfare loss is found by summing transaction costs (the efficiency loss of erroneous final 
ownership) to rent dissipation i.e., 

/8(g) ^ pldv - c) + k B + k* s = yfar (a(v - c) + V B ) = [^giggg) (7 . 12) 

where w= v + c and o = v-c denote total and differential value of the item for the parties. 

This function of the trade impediment a is bell shaped with a maximum for a = g|. 
Minimum welfare loss is thus achieved either when there are no transaction cost (a = 0) 
or infinitely many so (a = 1). When trade is sure to take place (a - 0), parties become 
symmetric (V B = Vs-f) and thus invest heavily; however this is the only source of wel- 
fare loss. When trade is almost impossible (a - 1), the item becomes more valuable for 
the buyer than for the seller ( V B = v,V s - c), hence their lobbying efforts become different 
(cf §7.2.2) and are on aggregate lower; rent-seeking has thus allocated the item rather 
efficiently. However missed trade opportunities must be added to the welfare loss. 

To conclude, we might say that for a low transaction cost (a - 0), reducing it further 
is welfare improving but when trade impediment is of medium importance J| > a > g|, 
a reduction of transaction cost actually worsens the overall level of efficiency of the ex- 
change process because the gain in recovered trade opportunities is more than offset by 
the increased rent dissipation in lobbying. 

Ambiguous Property Rights 

Ambiguity in the allocation of property rights among two parties by the court is another 
source of transaction cost. One way to model this issue is to assume that after investing 
k( and kj, party i wins ownership with probability pi - x .^{. k . where A ; indicates the 
weight of legal precedent in his favor i.e., At = 0, 1 corresponds to certainty (no ambiguity) 
whereas ^i = \ indicates maximum uncertainty (and we are back into the pure lottery 
model). 30@ 

The prize for firm i in this context is the profit difference V t between winning and 
losing the allocation of property rights. 31@ Up to a constant, each party maximizes ni = 
PiVt - ki for i = A,B. Making again the change of strategic variable, from investment ki 



to leverage et = A/fcj, we observe that party i maximizes j^-A/Vi - et, thus we can apply 

(7.4) to deduce that in equilibrium k* = - ' ' ' \ 2 and p* = . „ { jj v . 

We therefore conclude that the party with more at stake invests more (i.e., V; > V) => 
A;* > A;*). Notice further that if she enjoys legal bias is in her favor (A; > 1/2) then upon in- 

' J 

creasing this bias (A, /), both investments fall so that a greater efficiency is achieved 32 ® 
so that we can claim: 33@ 

| It is efficient to assign property rights to the party with more at stake. 

Since the party awarded exclusive property rights will often use them inefficiently 
(because he/she ignores the externality produced on the loser), there is room for either 
an ex-ante or ex-post negotiation towards an agreement where the property rights are 
put to joint use. Let W* be maximum joint profit. 34 ® In our complete information setting, 
the agreement will be reached immediately (ex-ante) but under the menace of trial. Now, 
since investments affect the trial outcome, their equilibrium level will be positive. 

Indeed, if the parties fail to agree and go to trial, each expects p(Vi. Let 6 = W* - 
PiVi - pjVj the benefit of cooperation. Ex-ante, expected payoffs are m - § + ptVt - kt oc 
W* - pjVj + piVi -2k t oc Y-fc'+iV J ~ 2k i- Tne FOC are t nus symmetric and so is the equi- 

' ' J J 

librium with fc ; = kj = k = X i Xj- L Y ± (and p = \). Note that, with or without renegotiation, 
investments depend positively on A ; A ; -, thus wasteful influence always increases with 
ambiguity (A; — \). 

It is immediate to see that when V; > V,-, k* + k* <2ko A?Vt > A 2 F,-. The condition holds 

X V' 

if the ratio of precedent weights j- is large enough when compared to ratio of values -^ 

J ' 

(which is lesser than unity). Thus, if a party has more at stake and is advantaged by 
law, she invests so much in influence that the aggregate investments are greater than if 
Pareto-efficient negotiation was not available. Finally, if, on top of the previous condition, 
the overall inefficiency is not too large (i.e., 5 small) then we may have n* = p* Vt-k* > Hi = 
I + y- k i.e., a party prefers to go to trial (and stick to an inefficient outcome) rather than 
agree to an efficient use of the disputed item. This optimal behavior thus rationalizes 
some behavior observed in real life. 



7.4 Patent Races and Attrition 

This section considers dynamic conflicts. 



7.4.1 Patent Race 

In §12, we treat research and development (R&D) as a strategic investment aimed at 
improving the competitive position of the firm. In the present section, we study "patent 
races" whereby firms invest in R&D in order to make a valuable discovery and be the 
first to get a patent protection. We demonstrate a close equivalence with the standard 
model of rent-seeking although the conclusions derived from it are less dramatic because 
the patented discovery has a social value. 

Let us assume that all firms seek the same discovery, that the market value v of 
the patent is fixed and known to all and that it starts to accrue to the patent holder 
as soon as the discovery is made (no delay in the processing of the patent application). 
Loury (1979) shows that the instantaneous probability of making the discovery is very 
much like the winning probability seen in the contest technology. The expected benefit 
of investing is thus similar to the rent-seeking profit up to the presence of the interest 
rate that account for the dynamic nature of patent races. 

We proceed to prove this claim. For each contender, the time of discovery t is a random 
variable with law Pr(r < t) = G{t) and density g{t) = G'{t). A fundamental concept is the 
hazard rate jr^j which is the probability that the innovation is ready at time t + dt 
knowing it was not yet ready at time t. The model assumes this hazard rate to be time 
independent and a function h{k) of investement A;; thus Pr(r < t ) = 1 - e~ h{k]t and the 
density is Pr(r = r) = h{x)e- h[x]t . 35@ 

Let us denote z\ = h{kO,z-i = 'Ljjuhikj), t ; - the discovery time of firm i and t_; the 
discovery time of the earliest other firm. If t_j = t, firm i can enjoy the prize whenever 
Tj ; = s < t. Letting r be the interest rate, the conditional expected benefit is then 36@ 

mlt) = v fpr(Ti = s)e- rs ds=v C z^e-™ &s= v^{l- e~ [r+Zi)t ) 

*/ </ 

Assuming that the random processes governing firms discoveries are independent, we 
have 

Pr [r-i <t) = 1-Pr(r/ > t,Mj £ i) = 1- n e~ z ' t = 1 - e~ z - il 

so that Pr (t_; = t) = Z-ie~ z - lt . The unconditional expected benefit is 

+oo (+oo +oo \ 

n, = / z-te-^nMdt = v^fl f e- z -' l dt- f e Hr+Zi+z -' ]t dt\ 
o ' v o o / 

= V T^i (i - r+zl+z-i) = r+i+z^ V ( 7 - 13) 

so that the probability of winning the patent race is the contest formula (7.2) up to the 



interest rate r. 

7.4.2 Attrition 

A war of attrition is a battle among firms to control a market, a standard or a new 
technology; it ends after all but one contender accept defeat rather than at a fixed time 
or when a player is able to claim victory as in a patent race (cf. ch. 7 on rivalry). It 
can explain an Industry Shakeout, a phase of rapid decline in the number of producers 
and simultaneous expansion of aggregate output. Indeed, once contenders start to drop 
out, the remaining incumbents serve larger market shares and can thus take advantage 
of scale economies to become even more competitive and expel some more firms. The 
market size also grows because the reduced variety of products for sale increases the 
potential life cycle of the product, reduces the uncertainty over its core characteristics. 

A recent example is the portable game console market where Nintendo remained 
in 2000 the sole vendor after battling its competitors for 11 years (Atari, Sega, Tiger 
Electronics, NeoGeo) and selling more than 120 million units. Nintendo nevertheless 
faces new threats; Nokia in 2004 and Sony in 2005 launched portable game consoles. 
Lately Apple's iPod Touch has proven an unexpected comptetitor. 

The adequate game theoretical framework to analyze this kind of fight is the duel 
explained below. 

Value of Staying Modeling this struggle can lead to a peculiar equilibrium where one 
firm enters the market and the other one stays out. The obvious problem is to identify 
the winner. A more meaningful equilibrium is one where the two firms follow the same 
pattern of action which is a probability a of exit at each period of trade. The winner's 
prize is the monopoly profit n (sum of discounted future profits) while the loss is the 
fixed (sunk) cost F of developing the product. The value of staying in the market V is the 
sum of two expected terms, one for the case where the firm becomes a monopoly because 
the contender exited and one for the case where duopoly goes on for one more period. 
The first term is an while the second term is the discounted value today of staying in 
the market tomorrow, -^ where r is the interest rate of the firm's owner. Summing 
and removing the fixed cost, we obtain V - all + [1 - a)^ - F, thus V = (gn ~^ 1+r) . In 
equilibrium, each firm is indifferent between stay and exit 37@ and since the value of exit 
is zero, V must be nil; we therefore deduce that the probability of exiting the race is 
a = yi which is independent of the discount factor; what matters are sunk cost and future 
profits. 



Duel A good illustration of predation is the competition between Sony and Nintendo 
in the development of a new game console. The longer a firm spends on development, 
the better its product but the first to release has an advantage since its customers will 
remain captive (lock-in due to switching cost as studied in §24.2.3). 

A firm that releases its product first, at time t, captures the market share h{t). The 
remaining market share is left for the other firm. If they release at the same time, the 
total market is shared evenly. The profit function is thus 



7ti{ti, t 2 ) = < 



h{ti)-rt\ if t\ < t2 

\-rt x if h = t 2 

l-h(t 2 )-rti if h>t 2 



f+ 


if 


t 2 <t 




t 2 


if 


t 2 = t 


; very much like in the Bertrand 


t- 


if 


t 2 > t 





where h increases from h{0) = to h{T) = 1 and r is the interest rate (time is costly). Let 
t be such that h{i) = \. 

When f 2 < i (quick opponent), a choice t\ < t 2 implies h{t\) < h(t 2 ) < \ while t\ > t 2 yields 
a share l-h{t2) > |. Thus t\ = t^ (slightly more than t 2 ) is the best reply (r makes the firm 
impatient). When f 2 > i then t\ > t2 => 1 - h{t 2 ) < \ while t^ (slightly less than t2) yields 
/z(£p > \; it is therefore the best reply. Against t there is no need to rush or to wait, hence 
the best reply is also t. 

The best reply function is BR1H2) = < 

paradox. We see immediately that that no time greater than t can appear in equilibrium 
because it would be undercut. Likewise a lesser time would be "overpriced" by the oppo- 
nent so that both timing would increase. It is only against t that t is the best reply; the 
equilibrium is therefore the symmetric choice t that generates equal market shares. 

7.4.3 Performance based Compensation 

We compare here two related forms of contest, relative performance evaluation (RPE) 
and joint performance evaluation (JPE) which are discussed at more length in §13.1.2. 

Static Model 

A principal hires two agents to work in her business. Her control technology being im- 
perfect, she can either receive a common signal that everybody is working well, with 
probability a, or a differentiated signal for each agent (with complementary probability). 
The personal signal of an agent is good with probability qt where A; = (shirk) or 1 (work) 
is the previously chosen effort. We assume q\ > qo i.e., there is a positive correlation 



between effort and signal. Notice that agents' production technologies are independent. 
Wage can be contingent on signals but must be positive because no financial penalties 
are allowed by the law governing labor relationships. Given the auditing technology, a 
wage scheme for a worker is a vector w = {wn, u>\q, u>q\, wqq) where w uv > is the wage if 
the own signal is u while the colleague's one is v . The expected wage for agent i when 
efforts are respectively k and / is thus 

n k i{w)= {a + {l-a)q k qi)wn + {l-a)q k {\- qi)w w 

+ (1 - a)qi(l - q k ) u>o\ +(1 - er)(l - qi){l - q k )w o 

When the principal receives a good signal for worker j, she can either pay more to 
worker i to reward the team (JPE) or less to reward the best of the two agents (RPE) or 
change nothing. Analytically JPE is to set {Wu,Wq\) > {wio,wqo) while adopting a RPE 
amount to choose wages so as to reverse the inequality. Notice that JPE generates a 
positive externality since n k ±{w) > n k0 {w) for k = 0, 1 while RPE generates a negative one. 
Letting the cost of effort be c, a contract w induces effort for both agents as a Nash 
equilibrium if nn(w) -c> n iiw) o 

qiiwn - wqi) + (1 - qiKwio - w o) (7.14) 



{l-a){qi-qo) 



The aim of the principal is to minimize the wages while motivating hard work, thus 
she aims at minimizing n\\[w) under the incentive constraint (7.14). It is clear that 
setting wq\ = w 00 = will be optimal. Then (7.14) simplifies to {l _ a) [ q _ q \ ^ q\W\\ + (1 - 
qi) w\o and the expected utility becomes 

n u (w) = (1 - a)qi {q x w u + (1 - q x ) w w ) + aw u = L1 _ a) [ qi _ qo) + o w u 

so that setting W\\ - is also optimal. Minimizing wage commands to solve (7.14) with 
equality; we thus obtain then w s 1Q = a _ anq ^ qo]a _ qiy 38@ 

The optimal contract is therefore an extreme form of RPE with iv s = (0, u;^ ,0,0). The 
key to this result is the presence of noise (a): a good signal is more informative of 
effort if the partner's signal is bad. Thus, paying a bonus solely in that asymmetrical 
situation (which is the essence of RPE) generates a cheap motivation to work. 

Dynamic Model 

Consider now a dynamic setting of infinite repetition. The discount factor of agents is 
6 < 1 (alternatively 1-6 could be the probability of going bankrupt for the principal). 



The wage scheme chosen initially applies to all subsequent periods. We study the imple- 
mentation of effort for both agents. By shirking today, an agent gets n i(w) today and 
since the other agent can change his effort in the future (possible retaliation), he can 
guarantee himself at least mm{n o{w),noi(w)} in each future period. Under JPE where 
Ttooiw) < Ttoiiw) must hold, the incentive constraint for hard-work today, given that the 
other agent is working, is 

7tii[w) -c> {l-6)n i{w) +S7i 00 {w) (7.15) 

which is strictly slacker than (7.14). On the contrary, in a RPE scheme n 00 {w) > ^oi(^) 
must hold and this gives rise to an incentive constraint which is simplifies into the static 
(7.14). This observation means that JPE gives more possibilities to punish a shirker in 
a dynamic setting because the other agent can retaliate by shirking also in the future. 

Let us study the minimum of nuiw) under the incentive constraint (7.15) and the use 
of a JPE scheme. The latter reads 

(q 1 + 5q ){w n - w w - w i - w 00 ) + Wi + 6w i - (1 + 5) w 00 



(l-a){qi-qo) 



thus we can set w 00 = (negative in (7.15) while positive in objective) and w i = (same 
weight as w\q in objective, smaller in (7.15)). Since both the objective and the con- 
straint are linear in w\\ and w\q, only one can be positive, but given the restriction 
to use a JPE scheme it must be w\\. Hence the solution is w J = (w[ v 0,0,0) where 



w J = 



11 (l-a){qi-q ){qi+Sq ) w 

If we use a RPE scheme, we already know that the optimum is iu s ; comparing the 
final cost of both schemes shows that nniw) 1 < Jin{w s ) o 5 > — — f, _„. , hence JPE domi- 
nates RPE when the relationship is long lasting (or agents are patient). 

It remains to show that retaliation to shirking after observing shirking once is a 
credible behavior. Since (7.15) is binding at w J , 8 < 1 and noilw 1 ) > n Q0 {w J ), it must be 
true that n\\{w J ) -c< tto\{w j ) meaning that shirking is optimal when facing work. From 
equality in (7.15), we deduce 

TI\\{W) -TIqi{W J ) =6{n 00 {w) -noi(w)) + c<c 

Lasltly, noticing that 

7Tn(ii) ; )+7roo(^ 7 )-^io(^' 7 )-^oi(^ / ) = (l-o-)(<7i-go) 2 Wn>0 
we obtain u\q{w j ) -nooiw 1 ) < n\\{w J ) -noiiw 1 ) < c i.e., ni {w J ) - c < 7i 00 {w J ) meaning that 



(shirk,shirk) is a Nash equilibrium of the one stage game. This proves that under w 1 , 
the retaliation payoff for the future we considered in (7.15) was correct. Furthermore, 
w J is collusion-proof in the sense that the two agents get more by working than if they 
adopt any other pattern of behavior i.e., nu{w J )-c> n 00 {w J ) and 2{nn{w J )-c) >mo{w J )- 

C + 7Tqi{W J ). 

Our theoretical study has thus shown that RPE is better suited to static or short term 
relations and JPE to long lasting ones absent any consideration of positive externality 
among agents. 



PartD 



Antitrust Issues 



Chapter 8 

Legal Framework 



Market competition, contracts, vertical agreements and more generally all relationships 
between firms do not take place in the limbo, they are in fact mediated by a large legal 
apparatus guaranteeing each participant, he shan't be coerced into the exchange and 
that promises shall be kept. That is to say, the State machinery (justice, police) protects 
private property and enforces contracts smoothly. 

Except when stated otherwise, we assume in this book that the rule of law is indeed 
the reference situation for all economic agents. The object of this chapter is then to make 
sense of this framework. In the first (still sketchy) section, we work out the conditions 
and the limits to this ideal. The next sections present the current legal framework of the 
European and US economies. Due to their differing pace of economic and political devel- 
opment, we emphasize the political construction of Europe and the antitrust genesis in 
the US. 

8.1 Rule of Law 

Advanced economies live under the rule of law, by which we roughly mean that citizens 
and firms enjoy effective rights such as freedom (of speech, of religion,...) and political 
participation. Economic theory then projects this reality into a market nirvana whereby 
exhaustive property rights exist and are swiftly enforced by a costless, error-free judicial 
system working in tandem with a serviceable police. Obviously, such an idealized world 
rests on a State strong enough to uphold our political and economic rights. Without 
losing this proviso from sight, we place ourselves under the realm of the law and disre- 
gard any form of coercion. We can thus keep our adherence to individual rationality i.e., 
economic agents freely enter into contracts over items that have recognizable property 
rights. 

In accordance with the contract-based approach to economic relations taken in this 
book, the transposition of "political freedom" for citizens to economic agents (firms, en- 



trepreneurs) revolves around "property rights". Indeed, firms can be effective economic 
actors only if they can trust each other and do not fear opportunistic or fraudulent be- 
haviors. To guarantee such an environment, the State (cf §16.1.1) deploys a legal frame- 
work leaning on two pillars, litigation and regulation whose respective enforcers are the 
judicial system (justice and police) and the bureaucracy, itself encompassing ministries, 
public commissions and agencies (cf. §13.1.3 on bureaucracy). 

8.1.1 Property Rights 

Property rights are held either privately (individual, firm, very small number of people), 
communally (large group of people) or publicly (State) over items such as physical or 
financial assets, natural resources or knowledge. 1@ Following Grafton et al. (2004), we 
may identify several characteristics delineating property rights over an item: 

Exclusivity Ability to exclude others from either using or benefiting from a flow of 
benefits originating with the item. 

Transferability Ability to transfer or alienate, at will, the item or its flow of benefits. 

Duration Time over which the right remains in existence. 

Enforceability Extent to which the right is recognized in law (e.g., certificate of owner- 
ship). It measures how much protection is available from encroachment i.e., relates 
to the implementation of exclusivity. 

Divisibility Ability of the holder of the right to divide up the asset or the flow of benefits 
from the asset. 

Flexibility Limitations and obligations over the use of the rights not covered by the 
other characteristics. 2@ 

Under these conditions, Adam Smith's invisible hand (e.g.g, greed) pushes owners to 
perform profitable exchanges and in the end, items end up owned by those most able to 
put them at a productive use. Demsetz (1967) observes that property rights represent a 
social institution that creates incentives to efficiently use assets and further to maintain 
and invest in them. Their enforcement come at the hand of 

• courts (judicial) 

• administrative agencies (bureaucracy) 

• customs and norms (requires trading within kinship or brotherhood) 

• repeated market interaction (reputation building) 

• coercion (ultimate tool to force an individual to behave) 



Property rules form the legal basis for voluntary (market) exchange of rights because 
a right holder has the exclusive use of the asset and can exclude infringement. In con- 
trast, liability rules only award damages for infringement and thus form the basis for 
court-ordered non-consensual transactions. If transaction costs are low, many efficient 
transactions are performed under property rules. If these cost rise then property rules 
create a market failure and a change to liability rules may become advisable as it al- 
lows more efficiency enhancing transactions where the price is decided by the judge. The 
downside is the errors made by the judge that may leave one party worse off. In gen- 
eral, liability rules cannot create efficient long run incentives because of the constraint 
that what one party pays the other must receive cannot give both sides the adequate 
incentives. 

Origins of the Law 

Hobbes (1651)'s gloomy vision is an important reference in economics. The author claims 
that in ancient times, humanity lived in a "state of nature" free from ruler or government 
but also without rules governing ownership of scarce resources. Competing claims over 
these surely resulted in the widespread use of violence. Society was plunged into a war 
of all against all where the resulting life would be "nasty, brutish, and short." Although 
this recollection may at first look seem convincing, there is scant historical evidence of 
this "state of nature". On the contrary, it is a well accepted fact that our ancestors lived 
in small bands whose social coherence depended to a considerable extent upon inherited 
behavior patterns. 3 ® 

North et al. (2006) propose a framework based on economic reasoning for understand- 
ing the evolution of humanity. In a nutshell, human societies evolved from primitive to 
feudal. Then, most transformed into nation-states and among those, some successfully 
moved to free-market-democracies. Internal and external violence is ever present in this 
evolution but as countries grew richer, the State apparatus succeeded to monopolize the 
use of violence and thereby reduce internal strife and foster growth (cf §16.1.1). Yet, 
most facets of social life remained tightly controlled by the State. In the ultimate tran- 
sition, general freedoms are bestowed by the State upon the entire population leading 
to unprecedented levels of innovation, growth, material wealth and satisfaction for the 
citizens (longer and better life) as attested by Usher (2003). 

Wallis (2011) argues that one of the crucial change in human relationships brought 
about by the rule of law is that people and firms now hold rights whereas they used to 
enjoy privileges. A right is anonymous and can usually be transferred (sold) whereas a 
privilege is intimately linked to its receiver and thus cannot be passed, except to heirs 
in some circumstances. Furthermore, the privilege is bestowed by a ruler or a powerful 



economic agent onto someone in payment for past service or in the expectation that the 
favor will be returned. A right, on the contrary, is a service that the community confers 
free of charge to all its members without discrimination; it is universal whereas the 
privilege is particular. 

Acemoglu (2010) adds political struggles and social conflict to complete this panorama 
and explains in a convincing and intuitive manner why politically oppressive and eco- 
nomically inefficient regimes are so enduring (cf. §8.1.3). 

Emergence of Private Property A pillar of modern legal systems is private property 
which often emerged prior to the establishment of the law (cf. Gintis (2007)). This 
phenomenon can be rationalized as an equilibrium behavior in game theoretic models of 
rivalry or conflict over ownership of an item (typically land) between an incumbent and 
a challenger. 

The endowment effect or loss aversion states that one is ready to sacrifice more to 
defend a holding than to acquire it. There are many reasons why it holds. Firstly, the 
holder knows better the item than a potential acquirer who thus needs to apply a risk 
premium when estimating the item's value. Next, the holder may have invested into 
the asset so that its value has increased for him wrt. its original market price (cf. asset 
specificity). 4@ Lastly, there is a behavioral explanation, empirically validated according 
to which the incumbent can be identified as "rightful" for the very fact of being first 
holder. This observable feature serves as a signal which helps society minimize conflict 
over ownership. 5@ 

The transcription of these facets of loss aversion into models of asymmetric conflicts 
are asymmetric valuation (the incumbent values the item more because of complemen- 
tary and specific investments), asymmetric burden (the incumbent is more effective at 
defending the item than the challenger at contesting it) and commitment (the incumbent 
has committed long-lived resources into defending his position). As we show in §7.2.2, all 
features point in the same direction, namely that the incumbent expends more resources 
than the challenger so that the former remains in place with a high probability i.e., there 
is a natural respect for private property. 

Limits to Law 

However strong the rule of the law may be, a rational self-interested individual will 
always strike a balance between lawful and unlawful means of acquiring wealth (or be- 
tween moral and wicked ones if his education makes him a perfectly law-abiding citizen). 
Because the technology for identifying and catching thieves is far from perfect and the 
death penalty cannot be used for minor offenses (what deterrence would be left then for 



major offenses?), the expected penalty from wrongful behavior is always lesser than the 
expected reward, especially for those who have nothing to lose. The prevalence of vio- 
lence and fraud even in the most advanced societies is proof of this fact. This grey area 
where agents seek to discover the limits of the law is treated in chapter 7. Let us only 
say, applying marginalize thinking, that the State will stop investing into the repressive 
machinery when at the margin, a dollar of taxed wealth fails to increase GDP by a dollar 
i.e., it is optimal to allow a small amount of unlawful activity. For instance, small tax 
frauds are not pursued by IRS inspectors. 

8.1.2 Regulation vs. Litigation 

The main theme of this paragraph is that litigation is optimal when the issues at stake 
are well defined and can be included into contracts. 6@ When market failures such as ex- 
ternalities or asymmetric information among parties are present, litigation loses its bite 
to compel firms to behave in a socially optimal manner. Although the bright-lines rules 
of regulation are inefficiently rigid (aka "red tape"), they may fare better to reduce the 
impact of socially wasteful conducts. This is why the State imposes so many quantitative 
regulations such as firm qualification, product certification, minimum quality standards 
or occupational licenses for individuals. Much of these compulsory informative certifi- 
cates are delivered by private intermediaries in the shadow of the State who makes sure 
that they are honest i.e., neither an impersonator of a true certifier nor a malevolent one. 

Contract Enforcement 

The two technologies used by the State to uphold contracts are litigation and regulation 
(cf §16.1.1). In this section, we describe them precisely and give some elements of com- 
parison using the similarity with the "make or buy" " quandary that figures prominently 
in §13.3.3 on vertical integration and transaction costs. 

A commonality between litigation and regulation is their mode of action since both 
are based on deterrence: whenever an economic agent is found guilty of infringing a legal 
rule or a private contract, the State can use its coercive power to penalize him (with a fine 
or a prison term). The rational agent will then abide by the rule if the expected penalty 
is larger than the gain from misbehaving, that is to say, if both the probability of being 
(identified, tried and) condemned and the effective fine are large enough (this strategic 
interaction is detailed by way of an example in §2.4.2 on sequential games). Litigation 
and regulation however differ in two important dimensions, namely the timing of the 
State intervention and its odds of success. The optimality of a mechanism vis-a-vis the 
other will then depend on how it fares on those two dimensions. 



Litigation is an ex-post mechanism because it is called to the rescue after the parties 
to a contract (sometimes an implicit one) have started to perform, precisely when one 
party believes the other has misbehaved. The main enforcer of litigation is the public 
judicial system but it does not preclude private arbitration to avoid the cost of actually 
going to court (cf §2.4.3) 7@ . If complete contracts could be written at no cost, then 
every single contingency would be covered so that economic transactions would develop 
smoothly without ever resorting to the judicial system. Indeed, no one would complain 
because no one would break the rules and this is so because everyone knows that such a 
behavior would be exposed and punished. As shown by Coase (1960), even the traditional 
sources of market failure, externalities and asymmetric information, could be dealt with 
successfully in that ideal framework (cf. next section). 

Regulation is an ex-ante mechanism whereby the State forces firms to comply with 
some rigid but clear rules that mostly aim at guaranteeing minimum quality standards. 
A specialized bureau (either public or private) delivers certificates of compliance and 
perform inspections to make sure that the rule is followed by all. Offenders can be fined 
or even jailed. Occupational licenses are one example that applies to individuals who 
want to enter a regulated profession. Because a regulation forces producers to invest 
into assets such as human capital or specialized machineries which are only imperfectly 
related to the quality of their output, there is a productive inefficiency for high quality 
firms. Indeed, for such firms, the regulation is a source of fixed cost (deemed "red tape") 
that fails to motivate a higher quality choice (wrt. what they would elicit in the absence 
of regulation). In a world of complete contracting, regulation is worse than litigation but 
as we shall now demonstrate, regulation can fare better in a world where contracts are 
necessarily incomplete. 

Comparison 

A first weakness of litigation as a technology of legal enforcement is its proclivity to suffer 
demand shocks. Indeed, two large firms can trade millions of units in a single transac- 
tion; if a problem occurs, a single judge is called to decide. 8@ On the contrary, when a 
firm sells goods or services to millions of consumers, a simple defect in the production 
process can lead to thousands of lawsuits which can easily overload the judicial system. 
Although the involved firm will be surely chastised, all other firms in the country will 
be safe from prosecution (because of congestion) and start to behave as if anything was 
permitted. Alternatively, the State, in order to avoid congestion of the judicial system, 
refuses to consider but a handful of lawsuits which means that rules are poorly enforced 
and again the incentives towards firms are greatly diluted. Class-action, the grouping of 
individual demands, is a recent answer to this issue but the fact that trials can last for 



a decade works against litigation. Regulation, in those instances, is far more economical 
since it only requires the control of a small number of producer by a limited number of 
inspectors to obtain the same desired level of quality compliance. 

Litigation's second weakness is its reliance over clearly delineated property rights. 
As we argued in this chapter, many important aspects of a transaction cannot be inte- 
grated into a contract. For instance, gas emitted by thousands of sources such as cars 
or industrial plants are, as a whole, responsible for the deterioration of air quality in 
cities but it cannot be proven that a particular source caused a particular disease, hence 
litigation has no bite to reduce this costly negative externality 9 ® The problem here is 
that emissions are not associated to property rights; a regulation will improve upon lit- 
igation by forcing each industry to take preventive measures in order to reduce overall 
emissions. 

Finally, the last weakness of litigation relates to causality or the establishment of ac- 
countability. Consider a train conductor; in case of accident, it is difficult to sort bad luck 
from undue care and therefore to prove responsibility. This means that firms providing 
low level of quality would not be chastised with sufficient frequency, thus the deterrence 
effect of litigation would fail and accidents would not stop. The imposition of a minimum 
quality standard (with controls) or the mandatory display of information reduces the 
uncertainty regarding quality of the item and eliminates low quality 10 ® 

To conclude, regulation is an ex-ante rigid process enforced by specialized bureau- 
crats whereas litigation is an ex-post flexible process enforced by generalists judges. The 
latter tend to dominate the former when the involved issue can be delineated by prop- 
erty rights, when it has a deterministic causal relationship with the defendant and if it 
involves firms only (small numbers). This is why the State imposes so many quantita- 
tive regulations such as firm qualification, product certification, minimum quality stan- 
dards or occupational licenses for individuals (cf §16.2 & §9.1.2). An analytical model 
comparing the respective merits of centralized vs decentralized trading mechanisms is 
presented in §14.4.2. 

8.1.3 Coase Theorem 

We develop the original, a popular and lastly a political version of the theorem in relation 
to property rights. 

Intuition 

Coase (1960) reminds us of something generally taken for granted, namely that a clear 
delineation of property rights is a pre-condition to exchange. In the modern globalized 



and urbanized world, economic agents create countless (positive or negative) externali- 
ties in the course of their productive activities. These are a source of inefficiency because 
they are not marketed and thus have a zero price that does not match their economic 
cost or valuation. The fundamental contribution of this author is to stress that it is pre- 
cisely because the carrier of an externality is not amenable to a proper definition that 
the latter is not marketed in the first place; there are transaction cost because it is hard 
to define, specify, measure and enforce the property rights related to an externality. 

Hence, transaction costs are what ultimately prevent voluntary bargaining from at- 
taining Pareto-efficient outcomes. In order to minimize these costs, economic institutions 
such as markets (driven by prices) or command-and-control bureaucracies have devel- 
oped. The same logic is at work in Coase (1937)'s analysis of the "make or buy" dilemma 
of firms (cf §13.3.3). When transaction costs are important, some efficient exchanges 
are impeded. There are then obvious incentives to come up with innovations that re- 
duce transaction costs such as better institutions. Money, for instance, eliminates the 
transaction cost of barter which requires a double coincidence of wants. 

Popular version 

Stigler (1966), a prominent member of the Chicago School, popularizes the previous find- 
ings as the Coase Theorem in the following manner: "the initial allocation of legal enti- 
tlements does not matter from an efficiency perspective, so long as they can be exchanged 
in a perfectly competitive market or, alternatively, without suffering transaction cost". 
Critics of this exposition, such as Usher (1998) and Dixit and Olson (2000), deem this 
statement a tautology. Indeed, the tongue-in-cheek version is simply that "no big bills 
are ever left on the sidewalk", a mere restatement of Smith (1776)'s invisible hand prin- 
ciple, pointing out "the power of competitive markets to allocate resources efficiently" (cf. 
our straightforward proof in §2.4.3 on bargaining). Yet, following Machiavelli (1532), we 
may turn the theorem on its head and say that "nobody will never pass up an opportu- 
nity to deceive a fool" i.e., to gain a one-sided advantage by exploiting whatever is at his 
reach. This duality is developed in chapter 7 on rivalry. Its starting point is precisely 
the absence of perfectly defined rights for many aspects of economic activity. This reality 
motivates people or firms to appropriate old wealth rather than work hard to create new 
worth. 

In terms of policy implications, the Chicago version of the Coase theorem logically 
implies that a Pigouvian tax or a quota is likely to stumble upon transaction costs as 
much as "laisser-faire" would do. Hence, the latter option is advisable as it saves on 
implementation cost such as passing laws, creating bureaucracies and more generally 
public spending and its associated distortions (cf. §17.1.2). 



Political Coase Theorem 

The extension of the Coase Theorem to the political sphere would suggest that political 
and economic transactions create a strong tendency towards policies and institutions 
that achieve the best outcomes given the varying needs and requirements of societies, 
irrespective of which social group holds political power. Carrying this idea to its limit, 
we would come to see our current institutions as fully optimized. Furthermore, war 
would never take place as it systematically destroys wealth. Such conclusions are grossly 
incompatible with the historical record of wasteful economic policies and murderous wars 
observed across humankind. 

Acemoglu (2003) sees strong empirical and theoretical grounds for the alternative "so- 
cial conflict" theory. In essence, societies choose inefficient policies and institutions be- 
cause political elites have the (mostly violent) means to carry out their preferred choices 
which are rarely aligned with those of society as a whole (cf §16.2.4). Even though a 
policy or institutional innovation has the potential to create aggregate wealth, it is not 
undertaken because there is no way to compensate all those who would lose from the 
change. The inexistence or inability to set-up redistributive channels lead groups with 
political power to oppose efficiency enhancing proposals. Hence, transaction costs are 
at the root of inefficient institutions and policies. A similar phenomenon takes place in 
economic interactions between firms when they lack credible redistributive mechanisms. 

8.2 European Union 

We first draw an historical sketch of the political unfolding of the European Union. Then, 
we single out the relevant features for the economic analysis: the articles of law and the 
rules of the game between all parties, firms, national governments, European Commis- 
sion and Court of Justice. 

8.2.1 Historical Development 

Roots 

In 1945, after centuries of almost permanent warfare, European leaders finally under- 
stand that the path to peace and prosperity is to form an economic and politic league. The 
founding father, Jean Monnet, is a convinced federalist, specialist in economic planning 
with good connections in the US. As French planning commissioner in charge of coor- 
dinating the reconstruction effort at the end of WWII and European coordinator of the 
Marshall plan, he foresees the need to link the French and German economies to make 



future wars materially impossible. Since mining (coal, iron ore) and steel production are 
the basic inputs for these recovering economies, he proposes a pooling of resources in 
these sectors between the two countries and any other willing to join. 

1951: European Coal and Steel Community (ECSC) 

The treaty of Paris, 11@ signed by Belgium, France, (West) Germany, Italy, Luxembourg 
and the Netherlands, establishes the ECSC which is founded on a common market, com- 
mon objectives and common institutions. The stated objectives are to contribute, in 
harmony with the member states economies, to economic growth, full occupation and 
improvement of the standard of living. On the technical side, the goals are to expand 
production, organize it rationally, increase productivity, safeguard employment, lower 
prices while duly remunerating capital and finally promote international trade. 

The institutions are the High Authority, an executive supranational body, whose mis- 
sion is to implement the treaty's objectives, a Court of Justice ensuring that the law is 
observed in the interpretation and implementation of the treaty, an Assembly represent- 
ing the parliaments of the member states (with very little power) and the Council of 
member states representatives which authorizes some activities of the high authority 
and generally harmonizes it with member states economic policies. 

This treaty focuses on the commercial policy for coal and steel; it already contains 
most of the current European competition rules and gives the high authority powers to 
enforce them against firms but also against member states if necessary. It is clear for 
the drafters that the treaty establishes a "consumers' union" that will limit the natural 
tendency to output restriction of "business cartels". Apart from the aspect pertaining to 
the functioning of markets, most decision making in the treaty is based on unanimity in 
order to protect the peculiarities of each member state. 

1957: European Economic Community (EEC) 

The ECSC's fast success is not followed by political integration as attested by the failure 
of setting-up a defense community in 1954. To avoid a loss of momentum, the belgian 
foreign affairs secretary, Paul Henri Spaak, devises a further economic integration that 
culminates with the Rome treaty establishing the EEC. It basically extends the previous 
treaty to all sectors of the economy in order to build a common market; the executive body, 
renamed "Commission", loses some of its prerogatives in favor of the Council especially 
in the conflicting areas such as agriculture where the treaty remains vague. 

The updated objectives are to instate a regime of undistorted competition, remove 
duties and quotas on goods circulating within the community, eliminate the obstacles 



to the free movement of people, services and capital, coordinate national policies and 
approximate national laws to the extent required for the functioning of the common 
market. A period of twelve years is set for the achievement of these objectives together 
with binding timetables and numerical benchmarks. Regarding competition law, a 1962 
council directive gives practical directions for its implementation. 

A key novelty is the establishment of a Common Agricultural Policy (CAP) whose 
aims are to increase productivity, guarantee revenues of farmers, stabilize markets, 
guarantee security of supply and low final prices. The weight of the sector in the mem- 
ber state economies as well as the disparities of regions is underlined. The policy tools 
include price control or support, production and marketing subsidies or storage but none 
should generate discrimination among producers or buyers within the community. State 
aid is broadly authorized for inefficient farms or regions. The implementation of the 
CAP focuses on creating a common market (thereby with a unique community price), on 
giving preference to community products and on establishing financial solidarity within 
the community; it takes place only in 1962 after a lengthy process of reconciliation of 
the differing positions held by Germany and France. 12@ The CAP basically uses prices 
support and direct subsidies. It sets import tariffs so as to raise world market prices to 
a so-called EU target price and stands as an unlimited buyer to maintain community 
prices above a so-called intervention level. 

The last innovation in the treaty is a transport policy which is set-up with the broad 
aims of removing price discrimination for community trade and barriers to entry on 
national markets (i.e., same rules for everyone); it includes road, rail and waterways but 
excludes maritime and air transport. Like in other areas, the council can unanimously 
derogate from the previous competitive rules to protect a region negatively affected. 

1986: Single European Act (SEA) 

In 1965, a grave crisis over the CAP leads France to a 6 month boycott of the council, 
thereby effectively blocking EU institutions and policy. General de Gaulle refuses the 
passage into the second phase of integration scheduled in 1966 whereby every member 
state will lose his veto power in the council. France finally rips off the "Luxembourg com- 
promise" calling for unanimous decisions in cases concerning a country's vital interests. 
Although the customs unions is achieved ahead of schedule in 1968, the entry of vastly 
different countries in the community slows down the integration process regarding the 
removal of non-tariff trade restrictions such as bureaucratic regulations. 

In 1986, under the leadership of the Commission head Jacques Delors, member coun- 
tries sign the Single European Act, a rejuvenating act, looking to achieve the four free- 
doms of movement (people, goods, services and capital) by adopting qualified majority 



voting instead of unanimity for all actions necessary to implement the European Single 
Market aka internal market. The "acquis communautaire" gathering all EU law accumu- 
lated so far is mentioned as a foundation for future legal developments. The parliament 
is also associated to many decisions instead of being merely consulted while the ability 
of the council to overturn the commission's ruling is being restricted. 

1992: European Union (EU) 

The Maastricht Treaty creates the EU and introduces new forms of cooperation between 
member states, on defense, justice and home affairs. It also defines the so-called four 
freedoms of goods movements, people, services and capital and implements an archi- 
tecture based on three pillars: the existing European Community, a common policy for 
security and foreign relations and a cooperation for internal policy and justice. 

On the other hand, two principles limiting the powers of the Commission, already 
visible in the case law of the ECJ, are formally stated. According to the proportionality 
principle, the EU acts within the limits of its powers and of the objectives assigned to 
it. According to the subsidiarity principle, the EU is to act only if a common objective 
cannot be implemented in a satisfactory manner by member states acting independently 
so that, for reasons of scale or externalities, the objective is better attained by a concerted 
european action. 

The EU treaty also lays down the provisions for an Economic and Monetary Union 
(EMU) that is achieved in 1999 with the transfer of the monetary policy sovereignty of 
12 countries to the European Central Bank (ECB); national currencies disappear in 2002 
with the introduction of the Euro whose symbol is €. 

Regarding political developments, the EU Treaty strengthens the powers of the Par- 
liament, the only European institution directly elected by its citizens and creates a cit- 
izenship of the Union which permits EU citizens to vote and stand for election in local 
and European elections in the Union. 

Recent Developments 

With the Amsterdam Treaty of 1997, the Union underwent further reforms. The growth 
of cross-border problems such as crime, asylum seekers etc. led the EU countries to inte- 
grate large sections of what had previously been national prerogatives (border controls, 
immigration and asylum policy). The common external and security policy remained or- 
ganized on an intergovernmental basis, but in future the European Council was to be 
allowed to adopt common strategies, actions and positions. In addition the post of "High 
Representative for the Common External and Security Policy" was created. This was to 



give the policy a "public face". The Amsterdam Treaty entered into force on 1 May 1999. 

The enlargement of the EU to include ten new members in 2004 represents an his- 
torical step towards reunification of the continent and the promotion of peace, security, 
stability and prosperity. However, wealth differences between the old and new EU mem- 
bers remain substantial so that catching up with the average will be a process calling for 
a considerable effort and political sensitivity. 

Lastly, the 2008 Lisbon Treaty, superseding the "EU Constitution", seeks to safe- 
guard the union's ability to function and take decisions effectively by widening the use 
of super-majority voting for decision taking. 

8.2.2 Institutions of the EU 

The European Institutions are the Commission, the Council, the Court of Justice and 
the Parliament. 

Commission 

The Commission is the government of the EU, it is a supranational body holding federal 
powers, has the sole authority to take initiatives to create new Community law and mon- 
itors compliance by the member states with EU law. The commissioners are nominated 
by agreement among member states and are to act independently while member states 
are required to abstain from pressuring them. 

The commission makes hearings that help motivate its decisions are taken by simple 
majority but generally endeavours to reach a consensus; it can enact regulations which 
are compulsory orders, directives whose objectives are to be implemented by member 
states the way they see fit and opinions which are purely informative. The commission 
must report yearly in front of the assembly representing the parliaments of the member 
states. 

Councils 

The first treaty created a single council composed of one delegate by member state along 
with voting weights reflecting to some extent economic might and population. Its task 
is to To attain the objectives of the treaty, the council, upon proposal of the commission, 
enact regulations which the commission implements. The complexity of tasks has lead 
to a significant expansion. 

The European Council, composed of the Heads of State and the President of the Eu- 
ropean Commission, defines the general political objectives and directives of the EU; it 



meets at least twice a year. 

The Council of the European Union or Council of Ministers, is the effective legisla- 
tive body of the EU. Its members are the specialized ministers of the member states, 
depending on their particular portfolio. It adopts all the essential legal acts and con- 
cludes international agreements, mostly through qualified majority voting. 13@ However, 
in practice the EU endeavors to avoid majority decisions so as not to have to take deci- 
sions against the will of individual member states. In the event of disagreement, pro- 
posals are therefore generally withdrawn or negotiations continued until unanimity is 
reached. Important decisions such as the accession of a new state or the transfer of new 
powers to the EU always require unanimity. 

Court of Justice 

The Court of Justice is responsible for the interpretation and application of the Treaties 
establishing the European Communities and of the provisions laid down by the compe- 
tent Community institutions. Its members, in odd number, are appointed in a fashion 
similar to those of the commission. To expand the court's capabilities, a Court of First 
Instance is created in 1989, turning the original into an appeal court. The Court of Jus- 
tice has dealt with 13500 cases between 1951 and 2004 mostly involving environmental 
law, consumer protection, agriculture, tax, social policy and institutional law. The Court 
of First Instance has dealt with 4100 cases between 1989 and 2004 mostly involving 
competition law, State aid, trademarks and agriculture. 

Parliament 

In the first treaty, the assembly was composed of members of national parliaments and 
had quite limited power. With the widening of the union from the economic sphere to 
the political one, the parliament has seen its role increase and since 1979 it is directly 
elected by the citizens of the Union. 

In the EU legislative process, the Parliament either has a consultative opinion (the 
right to be heard) or the same decision making authority as the Council of Ministers 
(co-decision) depending on the subject concerned. Parliament adopts the annual budget 
with the Council of Ministers and controls its implementation. It is also a supervisory 
body in the sense that it confirms the appointment of the Commission and can even force 
that body to resign. 



8.2.3 Economic Principles 

The 1951 treaty sets out clearly that the community is to work as a free market economy 
without distortions (art. 2-4). However, it does not surrender fully to economic liberalism 
since it allows a series of potentially distorting instruments provided their cost-benefit 
analysis is positive for the attainment of the general community objectives. 

Pro-Competitive Principles 

Anti- competitive practices such as predatory pricing with a view to build monopoly and 
price discrimination are prohibited per se (art. 60). Offenders can be fined up to twice 
the value of illegal sales. Other illegal discriminations are that of workers based on 
citizenship (art. 69) and tariffs for transport (art. 70). 

The commission has full power to prohibit formal agreements, concerted actions tend- 
ing to distort free competition through price fixing, limitation of production, innovation 
or investment, the sharing of markets, customers or inputs sources. Offenders can be 
fined up to 10% of their annual turnover (art. 65). 

Mergers and acquisitions of firms active in the coal and steel sectors must be notified 
to the commission who can ask the parties any useful information and should take into 
account the market structure in its analysis of the case which can be favorably resolved 
by asking the parties to take the remedies it sees fit (art. 66). Small scale operations 
are automatically authorized ("de minimis" rule); 14@ otherwise clearance is given if the 
operation does not confer the merged entity the power to control prices or production or 
distribution or reduce the competitiveness of the involved markets or obtain a dominant 
position in the access to inputs or outputs. Failure to notify large mergers is fined up to 
10% of the merged assets. A last feature (oddly placed in an article on concentration) is 
the ability to intervene a firm, public or private, abusing its dominant position (in the 
sense that its behavior is contrary to the objectives of the treaty). 

The power of member states to subsidize national firms through direct aid or wages 
(aka State aid) is being limited (art. 67-68). Lastly, the commission is to give support to 
R&D whose results are to be shared with the community (art. 55). 

Distortionary Principles 

We review here articles safeguarding the interests of member states against unrestricted 
economic liberalism inside the community but also defensive articles aiming at protect- 
ing the community as a whole against foreign competition. 

In case of crisis such as a negative demand shock (art. 58) or a negative supply shock 
(art. 59), the commission can set up quotas and even temporary compensate high cost 



firms (art. 62). However indirect instruments such as minimum and maximum prices 
are preferable (art. 57, 61). Agreements can be exempted of the per-se prohibition) for 
specialization or pooling of resources if this improves production processes or if the agree- 
ment does not distort competition meaningfully (art. 65). The council can set minimum 
and maximum custom rates (art. 72). In case a foreign firm (outside community) com- 
mits dumping (excessively low price), flooding (excessively large imports) or abuses of its 
dominant position, the commission can act to protect the community (art. 74). 

Changes brought by the EC treaty 

The 1957 treaty extends the rules valid for the coal and sectors to all sectors of the 
economy. The free circulation of goods applies not only to goods produced in the commu- 
nity but also to those having lawfully entered any member state (art. 10); this the origin 
of the exhaustion principle. The elimination of custom duties (art. 12-17) and quotas 
(art. 30-37) inside the community is organized. A common customs policy with the rest 
of the world is also set-up with the aim of establishing bilateral agreements to remove 
barriers to trade (art. 18-29). Temporary safeguards are planned to attend difficulties in 
any member state. The large number of articles and paragraphs indicates the toughness 
of negotiations and thus the disparities of national situations. The agricultural policy is 
based on a list of products escaping the competition rules, a list which can be increased 
by a qualified majority vote of the council. 

8.2.4 Current Competition Law 

Origins 

As recalled by Roover (1951), monopolies and artificial restrictions were outlawed in Ro- 
man times (cf. Justinian Code c. 530). Price fixing and discrimination was likewise seen 
immoral since the price was to be set either by the State or by the interplay of demand 
and supply (cf. concept of just price). Even the granting of monopolies by kings is the sub- 
ject of criticism from intellectuals whenever they involve basic goods or services. These 
moralistic views have passed into the classic writings of Smith (1776) and from then on, 
economic theory has rationalized their relation to the concept of market efficiency. 

Goals 

The EU treaty 15@ states in its principles that member states are to adopt an economic 
policy conducted in accordance with the principle of an open market economy with free 
competition. The European Community competition law is formed by articles 81-89 and 



is enforced by the EC. Specifically, art. 81 prohibits agreements and concerted practices 
between firms (i.e., , , restraints), art. 82 prohibits the abuse of dominant position, art. 83 
calls for the setting up of directives implementing the previous goals, art. 84-85 cover the 
transitional period, art. 86 details how competition rules apply to public service providers 
while art. 87-89 covers State aid. Additionally, the merger regulation covers horizontal 
concentration that was not originally contemplated in the original 1957 treaty. 

The EU competition policy has two complementary economic goals. The liberalization 
of monopolistic sectors (art. 86-89) is a pro-competitive activity that aims at introducing 
competition where it does not currently exists and seeks to foster the integration of Eu- 
ropean markets into a single unified market, the so-called "internal market". Liberal- 
ization is treated in §16.4 while public service obligations are dealt with in §17.3.4. The 
control of State aid belong to this branch because it potentially distort the conditions of 
competition; the long term goal is to eliminate any public funding that is not warranted 
by public service obligations. 

The second goal is to maintain the conditions of "free competition" i.e., a competi- 
tion based on the merits of the contenders' offerings. This endeavor thus seeks to avoid 
anti-competitive behaviors like collusive agreements, restrictive practices and abuses of 
dominant positions. In this respect, articles 81 and 82 respectively deal with collective 
and individual abuse of dominant position. This activity called antitrust is developed in 
Part 10 (cf. §8.3 for the etymology). 

Realm of application 

A necessary condition for the application of the EU competition law is the existence of 
a direct or indirect, actual or potential Effect on Trade between member states of the 
EU. Otherwise national legislation is called for. Furthermore, the EC applies the Effects 
Doctrine stating that domestic competition laws are applicable to firms irrespective of 
their nationality, when their behavior or transactions produce an effect within the do- 
mestic territory (domestic means here either the EU or a member state). For instance, 
the merger regulation applies to companies located outside EU territory when it is fore- 
seeable that a proposed concentration will have an immediate and substantial effect in 
the EU. 

Most of the economic definitions we shall use in this book are similar to those from 
the EC's glossary or the OECD; this is a reflection of the fact that the landscape where 
firms compete in Europe is effectively shaped by the EC and European Court of Justice 
(ECJ). Three key concepts are worth emphasizing: firms, competition and market power. 

Firstly, the realm of application of the EU treaty with respect to economic activity 
is not limited to private profit-making firms; the relevant actor is the undertaking, any 



entity engaged in economic activity bearing economic risk regardless of its legal status, 
the way in which is financed or its motive. That said, we shall use the more usual 
terminology "firm". 

Secondly, competition refers to a situation where sellers of a product (or service) inde- 
pendently strive for the patronage of buyers in order to achieve their particular business 
objective. Sellers and buyers may be individuals or undertakings. Competitive rivalry 
between firms may take place in terms of price, quality, service or combinations of these 
and other factors which customers may value. 

The third key definition is that of market power; it is the ability for a firm to price 
above marginal cost and is computed with the help of a structural analysis of the market 
involving market shares, substitutability, barriers to entry , growth and the rate of inno- 
vation. Furthermore, it may involve qualitative criteria, such as the financial resources, 
the vertical integration or the product range. Market power is likely to be present in mar- 
kets with a small number of large sellers dominating a (competitive) fringe of smaller 
sellers. The former realize their interdependence in taking strategic decisions on price, 
output and quality while the latter adapt to their behavior to that of the larger players. 

Exemptions 

The EU law provides for some exemptions to the restrivitve agreements prohibited by 
art. 81 but in any case, i.e., irrespective of the size of the firms and their market shares, 
no hardcore restriction must be present. These are 

• resale price maintenance: a maximum resale price is allowed but fixed price is not. 

• market partitioning by territory or by customer: exclusive or selective distribution 
system are allowed but unsolicited orders must be served. 

• exclusive and selective distribution systems can not be combined. 

• spare parts manufacturers can sell to end users, independent repairers or service 
providers on top of the main original equipment manufacturer. 

To benefit from exemption, several criteria may apply. They require the computa- 
tion of the market share on the relevant-product and geographic-market (cf §15.3.1). 
Firstly, agreements that do not impinge meaningfully on trade between member states 
make art. 81 not applicable altogether; we find here agreements between small firms as 
defined in the de minimis notice (i.e., market share < 15%) and agency agreements where 
the principal bears all financial or investment related risk. Next, for sizeable firms (mar- 
ket share < 30%), the block exemption regulation (BER) articulates art. 81(3) which al- 
lows agreements confering sufficient benefits to outweigh their anti-competitive effects. 



The BER covers vertical agreements such as supply and distribution with a special rule 
for car distribution (cf. Table 15.18). Also covered are technology transfers, horizontal co- 
operation (R&D, specialization), insurance (standardization and information collection), 
transport and telecommunications. 

8.3 US Antitrust Laws 

The current European economic landscape has been heavily influenced by the US ex- 
perience which we now proceed to describe following its historical developments using 
the present tense given that many of the legal acts involved are still in force. A classic 
reference is Posner (2001) (cf. FTC for an online introduction). 

8.3.1 Historical Development 

Economics is deeply rooted in US culture. Indeed, the US Constitution of 1776 and the 
Bill of Rights of 1791 guarantee, among other rights, the economic rights to acquire, 
use, transfer, and dispose of property (including intellectual) and also the right to choose 
one's work. 

In the late XIX th century, the American industry becomes increasingly dominated 
by large firms which flourish in the virtually regulation-free climate of the age. Mem- 
bers of industries where access to markets is naturally restricted, such as railroads, are 
free to divide those markets between themselves and charge monopoly prices. Simulta- 
neously, firms in steel, oil and coal industries devise a new ownership structure called 
"trust" to monopolize their respective industries in a way quite similar to that employed 
by german cartels. The main leaders simply convince (or coerce) the shareholders of 
all the companies in their industry to hand over their voting rights to a newly created 
trust company in exchange for dividends. The trustee then strategically manages all 
the major competitors of that industry so as to operate as a virtual monopoly and rip 
the corresponding profits. These practices of collective price settings, eviction of new 
competitors and unrepentant use of corruption generate public outrage and enormous 
political pressure to halt the unfettered trade abuses which the trusts and monopolists 
represent for the democratic society (cf. Glaeser and Shleifer (2003)). 

The Sherman Act of 1890, quasi unanimously voted by the congress, establishes the 
US antitrust policy. In retrospective, Bork (1967) analyzes the act as an intent to advance 
consumer welfare by minimizing the restrictions to output. The Sherman Act outlaws 
all contracts, combinations and conspiracies that unreasonably restrain or monopolize 
interstate (e.g. between Texas and California) and foreign trade. This act provides crim- 



inal penalties when enforced by the government. Violation can result in substantial fines 
and, for individual transgressors, prison terms. These provisions are enforced primarily 
by the Antitrust Division of the Justice Department. In the 1990 update of the Sher- 
man Act, individual violators can be fined up to $350,000 and sentenced to up to 3 years 
in federal prison for each offense; corporations can be fined up to $10 million for each 
offense. 

In an early judgment of 1895, the US Supreme Court maintains a strict interpreta- 
tion of the Sherman Act and refuses to apply it to a cartel of sugar refiners producers 
controlling 98% of market on the grounds that the law applies to commerce and not to 
manufacture. The US president elected in 1902, Theodore Roosevelt, wants economic 
laws to balance the rights of companies and those of the citizens. Since many industries 
are dominated by a single company or a combination of companies controlled by a trust, 
he instructs the Justice Department to sue the "bad" trusts, those which fail to act in the 
public interest. In 1904, the Supreme court rules that the Sherman Act can be applied 
to holding companies. It deems an "illegal restrain to trade" the merger of Great North- 
ern and Northern Pacific railway companies into the newly founded Northern Securities 
(controlled by J.P. Morgan). In 1911, American Tobacco is declared an illegal monopoly 
and is broken up into separate companies. The same year, the Supreme court rules that 
J.D. Rockefeller's Standard Oil should be broken up into 33 companies. 16 ® 

Some ambiguities in the Sherman Act leads the US congress to complement it. 17@ 
The Clayton Act of 1914 prohibits certain specific business practices such as exclusive 
dealing arrangements, tying together multiple products, mergers or acquisitions that 
are likely to lessen competition or interlocking directorates (trusts formed by companies 
with common members on their respective boards of directors). The Federal Trade Com- 
mission Act of 1914 prohibits unfair methods of competition in interstate commerce and 
also creates the Federal Trade Commission (FTC) to enforce antitrust law by temporar- 
ily halting suspected anti-competitive practices. The Justice Department remains the 
investigator and, if necessary, the prosecutor of offending companies. The Clayton Act 
carries only civil penalties unlike the criminal penalties of the Sherman Act but it also 
permits an individual or firm who has been injured by an illegal practice to sue the per- 
petrator in the federal courts for damages 3 times as high as the loss plus attorneys' fees. 
This feature has proved to be an effective deterrent to antitrust infractions. 

The Robinson-Patman Act of 1936 amends the Clayton Act to prohibit price discrim- 
ination between purchasers of commodities of like grade and quality which are likely to 
result in substantial injury to buyers' competition or to sellers' competition. The only 
justifications are cost differences and reaction in good faith to meet the competition. 
The Celler-Kefauver Act of 1950 amends the Clayton Act on anti-merger measures. The 



Hart-Scott-Rodino Antitrust Improvements Act of 1976 makes it easier for regulators to 
investigate mergers for antitrust violations. To facilitate the task of firms who want to 
merge and need to check whether they will be allow or not, the Department of Justice 
issues the Merger Guidelines in 1968 with revisions in 1982, 1984 and 1992 (cf DoJ 
(1997)). 

For specialized industries, the Communications Act of 1934 regulates interstate and 
international communications by radio, television, wire, satellite and cable. The Federal 
Communication Commission (FCC) is the enforcer of the act. Likewise, the Federal 
Energy Regulatory Commission (FERC) established in 1977 originates in the Federal 
Power Commission established in 1920 to coordinate hydroelectric projects; the FERC 
mission is to regulate interstate commerce of energy (oil, gas and electricity). 

Regarding finances, the Federal Reserve Act of 1913 establishes the Federal Reserve 
to conduct the monetary policy, maintain the stability of the financial system and to reg- 
ulate banking institutions. The Securities Act of 1933 and the Securities Exchange Act of 
1934 establish the Securities and Exchange Commission (SEC); its duties are to secure 
the disclosure of adequate financial information for firms publicly offering securities for 
investment and to establish fair rules of the games for brokers, dealers, and exchanges 
(financial trading posts). The Commodity Futures Trading Commission (CFTC), estab- 
lished in 1974, protects market users from fraud, manipulation, and abusive practices 
related to the sale of commodity and financial futures and options (cf. also a list of 
antitrust agencies worldwide). 

8.3.2 Modern Application 

For legal scholars, antitrust laws are an integral part of the US economic system; their 
easily understandable statutory provisions embody fundamental principles that have 
been refined by a century of litigation in courts. Most antitrust laws are interpreted 
under the rule of reason i.e., the potential anticompetitive harm of a challenged practice 
is weighed against its business justification and its potential pro-competitive benefit. A 
judgment with respect to the reasonableness of the practice is made. On the other hand, 
the most blatantly anticompetitive conducts such as price fixing among competitors are 
judged per se illegal. 18@ It is enough to establish that the defendant has engaged in 
the proscribed practice; illegality follows as a matter of law, no matter how slight the 
anticompetitive effect, how small the market share of the defendants, or how proper 
their motives. 

Horizontal restraints to trade among competitors are the most serious of antitrust 
infractions because in a market economy, it is the duty of rival firms to compete with 



respect to prices, products, and services. Price fixing-an horizontal agreement affecting 
prices directly or indirectly 19@ -is one of them and is per se illegal. Other horizontal 
restraints per se illegal are the allocation of markets or customers and the concerted 
refusal to deal. 

Vertical arrangements and restraints involve relationships among suppliers and cus- 
tomers on different distributional levels; as they do not implicate direct competitors, 
courts are more more lenient towards them. The previous horizontal and vertical re- 
straints are antitrust offenses of a behavioral nature as opposed to those affecting the 
market in a structural way. 

8.3.3 Comparison of the EU & US 

Legal scholars give credit to the legal systems and practices of France, Germany and 
Italy in the drafting of the first European treaties (Paris and Rome), only acknowledging 
a background US influence. However, when we look at the body of law that matters 
for the study of economic conduct, one cannot avoid to see the resemblance with the US 
institutions described in the next section. 

While the first European Treaty (1951) was limited in scope to a single sector of 
the economy, it was audacious in terms of political integration with the enactment of 
two supranational bodies, the (executive) High Authority and the Court of Justice. This 
is an unusual feature for countries whose mode of relation (apart from war) has tra- 
ditionally been cooperation based on consensus i.e., unanimous agreement giving veto 
power to everyone. The root of this boldness is the understanding that the integration 
of economies damaged by war and before it, by the great depression, has to overcome 
the anti-competitive conduct of national monopolies in the basic input industries. The 
treaty's designers understood that a strong and effective "US-like" competition policy 
would be necessary for that task. This is why the treaty gave the High Authority broad 
powers very much like a US federal commission, so much so, it was later renamed the 
Commission. Likewise, one cannot miss the similarity of the Court of Justice with the 
US Supreme Court as the ultimate interpreter of the law (European Treaty or US Act). 

The 1957 EC Treaty's great achievement is to extend the successful coal & steel for- 
mula to most areas of the economy by putting them under the existing federal umbrella. 
At that point, the concept of European Commission is far more ambitious than a US 
federal commission whose scope is delimited by the Act creating it. The downside is 
obviously the bureaucratic challenge of carrying so many different activities under the 
same hood. 

In the US, the supreme court has often been called to either confirm or infirm the 



government and the federal commissions. Similarly, but less intensively, the European 
Court of Justice has produced a body of case-law upon which the Commission has up- 
dated its directives (orders in US parlance). 



Chapter 9 
Anti-Competitive Practices 



The ideal of perfect competition assumes independent suppliers, each of whom is subject 
to the competitive pressure exerted by rivals. It is therefore a priority task of competition 
law to preserve this feature and prohibit agreements or practices which might reduce 
such competitive pressure. 

We distinguish horizontal from vertical anti-competitive agreements. The first class 
regards the collective abuse of dominant position aka collusion and cartelization. Such 
a mutual understanding aims at coordinating the policies of the members to increase 
their overall market power. As we show in the first section, collusive agreements reduce 
market efficiency and are therefore rightly qualified as anti-competitive. We study their 
inherent instability and also review some historic evidence. 

The second class of anti-competitive agreements regards the relationship with the 
clients or suppliers. We deal with three categories. Best-price clauses can potentially 
limit competition and help enforce collusion by removing the ability to steal customers 
from a competitor. Vertical restraints are price or non-price limitations on distributors 
with a view to limit downstream competition. Lastly, the legal aspects of differential 
pricing are mentioned. We conclude with a few figures relative to antitrust activity. 

9.1 Cartel and Collusion 

A cartel or trust in US parlance 1@ is a public agreement among a group of firms while 
collusion refers to the secret version of the same agreement. Smith (1776)'s description 
of the subject is still topical: People of the same trade seldom meet together, even for 
merriment and diversion, but the conversation ends in a conspiracy against the public, or 
in some contrivance to raise prices. It is impossible indeed to prevent such meetings, by 
any law which either could be executed, or would be consistent with liberty and justice. 
But though the law cannot hinder people of the same trade from sometimes assembling 
together, it ought to do nothing to facilitate such assemblies; much less to render them 



necessary (§1.10.82). 

9.1.1 Inefficiency and Instability of Price Fixing 

Inefficiency 

We study the effect upon efficiency of an agreement among competing firms to coordinate 
their pricing policies, independently of whether this happens through collusion or the 
establishment of a cartel. When firms enter into an agreement, they substitute their 
individual freedom for a collective constraint; it might at first sight appear prejudicial 
for them but this commitment generates additional market power and thereby greater 
profits, which can be later shared. 

There is firstly a technical advantage to cooperate because the production technolo- 
gies can be exploited to reduce overall costs. Given a joint plan of production, if the 
marginal cost among members A and B of the cartel are not equal then a saving can be 
realized by switching some production from the dear to the the cheap plant. This makes 
the cartel a stronger competitor than the sum of the original member's strengths. 

The second benefit of cooperation is a strategic advantage for market competition. 
Coordination amounts to reduce the number of competitors which, ceteris paribus, in- 
creases industry profits; there is thus a possibility for the cartel to increase his absolute 
share of profits. As explained with more details in §15.2 on the merger paradox, if firms 
compete on quantities, the coalition of firms entering the agreement must be very large 
to be attractive to all their prospective members. The most frequent form of cartel in- 
deed gathers all major industry members except may be the smallest and youngest ones 
so that the adequate model is that of a monopoly (the cartel) facing a competitive fringe 
of price-taker firms. 

What is then the optimal behavior of a cartel formed by n firms? Their objective is to 
maximize the sum of profits 

n = E 7i t = E qiP(Q) - Ciiqd = QP(Q) - E Qiqi) (9.1) 

i<n i<n i<n 

where P is the inverse of the residual demand received by the cartel and Q = T.i< n qi- 
When the cartel gathers all the industry, this is the market WTP. 

The first step to maximize n is to find out the cartel's cost function i.e., given a total 
output objective Q, find the cost minimizing distribution {qi)i< n among members. The 
solution is the multi-plant cost function C{Q) (cf eq. 2.8) solving C mi i{q\) = ... = C min {q n ). 2@ 
Now, (9.1) reduces to maximizes n = QP{Q)-C{Q) which is a standard monopoly problem. 
As will be made clear in the next paragraph, each member is called to produce less than 



the "no-cartel" situation, which means that the cartel formation leads unambiguously to 
an aggregate output reduction. 

It is now easy to assess the welfare consequences of a successful cartel formation. 

Whatever might have been the intensity of competition prior to the agreement, the 
appearance of a cartel reduces competition, thus aggregate production which means 
that an inefficient change of market structure occured; the outcome is even worse in 
terms of consumer surplus. 

Instability 

As originally observed by Cournot (1838), anti-competitive agreements are naturally 
difficult to sustain because each member has an incentive to cheat or free ride on his 
partners by flooding the market. Recall indeed that the rationale behind the optimal 
monopoly behavior is the understanding that additional sales can only come at the ex- 
pense of an additional rebate to existing clients. This logic is less stringent for a member 
of the cartel because the rebate would be only for his own clients so that his personal in- 
centive to increase sales is stronger than that of the cartel (considered as a single large 
firm). To draw further implications we use the following equation: 

^m = {->m = Rm ^ Rm w-^/ 

where " denote cartel variables. 

The cartel's optimum is chosen so as to satisfy the second equality in (9.2); further- 
more, technical efficiency requires the first equality in (9.2) i.e., the production of individ- 
ual members generate the same marginal cost. Now, our discovery that a cartel member 
faces a greater marginal revenue enables to conclude that it is worthwhile for each mem- 
ber to increase production, since his marginal revenue is greater than his marginal cost 
when producing his share of the cartel's optimum. We have just uncovered the inner 
instability of cartels which is frequently illustrated by the frequent breaches of OPEC 
quotas agreements as shown in the next section. 

To see this result formally, assume that independent firms compete in quantities 
(Cournot) as it is the case for petroleum. The cartel revenue is R{Q) = Q x P{Q) where Q is 
the cartel's production while a member's revenue is R{q) = qxP(Q) = qxP(q+Q) where q is 
his share of the total production and Q = Q - q is the aggregate production of the remain- 
ing members. If we now look at marginal revenues, we obtain R m {Q) = P{Q) + QP'iQ) and 
the greater R m {q) - P{Q) + qP'iQ) (since Q > q and P' < 0). The member's marginal profit, 
when producing his share q of the cartel's optimum Q is thus R m [q) -C m > R m {Q) -C m = 



i.e., he ought to increase production beyond what the cartel recommends, more precisely 
he ought to produce the Cournot best reply to Q. 

In the example where C{q) = cq 2 /2 and D{p) = a- bp we obtain C{Q) = ^Q 2 and the 
collusive quantity solving R m {Q) = C m (Q) is Q = 2 ™ bc (this is the monopoly quantity q M 
where the cost coefficient c is changed into cln). The individual quantity is q = 2n ^ bc and 
the optimal deviation is q = a ~ ( "~^ — = "2+^ ^ which is greater than q as soon as n > 2. 

Price fixing agreements, either through a cartel or collusion, reduce competitiveness 
and the efficiency of the market. They are inherently unstable because each partici- 
pant has an incentive to "double-cross" his partners. 

It is clear that to maintain the stability of a cartel or a collusive agreement, each 
member must face the threat of punishment in case he does not fulfill his role. Firms 
use price wars to retaliate when a member deviates from the agreement. In this re- 
spect, mafias are successful cartels because they possess a very powerful enforcement 
mechanism, the death penalty. 

9.1.2 History of Cartels 

Origin 

A cartel (from the german word Kartel) is a formal agreement designed to limit or elimi- 
nate competition between participants, with the objective of increasing profits. Members 
can decide to fix prices, limit their output, share markets, allocate customers, adjudge 
territories or even manipulate auctions for public work procurement (cf §22.1.2). 

Historically, cartels developed in Germany during the 19 th century as associations of 
firms acting in the same industrial sectors; there were more than 500 at the beginning 
of the 20 th century. Competition on world markets quickly lead to the creation of in- 
ternational cartels; more than 150 were active between the two world wars. Nussbaum 
(1986) estimates that international cartels controlled approximately 40% of world trade 
between 1929 and 1937. More recently, the Japanese and Korean industries have used 
similar associations, respectively called zaibatsu and chaebol (literally "money clan"). 
Yet experience and theory have shown that in the long run the distortion on competi- 
tion is too strong and arms society (final consumers and firms which are clients of the 
cartels). For that reason, almost all countries prohibit cartels. 

Figure 9.1 presents a sample of cartels whose demise came from entry, cheating, tech- 
nological change and antitrust activity (cf. Levenstein and Suslow (2006)). 



Industry 


Year 


N 


Years 


Parcel Post 


1851 


5 


29 


Diamonds 


1870 


? 


60 


Ocean shipping 


1870 


2-8 


51 


Oil 


1871 


19-50 


2 


Railroad-Oil 


1871 


3-4 


7 


Railroad 


1875 


3-15 


4 


Potash 


1877 


3-30 


9 


Bromine 


1885 


7-15 


6 


Sugar 


1887 


8-19 


7 


Cement 


1922 


4 


40 


Steel 


1926 


4-8 


7 


Mercury 


1928 


2 


25 


Tea 


1929 


349 


4 


Rayon 


1932 


2 


8 


Beer 


1933 


500 


9 


Electrical Eq. 


1950 


40 


8 



Table 9.1: Examples of Cartels 



OPEC 



The most famous example of a legal cartel is the OPEC association of countries produc- 
ing large quantities of petroleum and natural gas. Each involved government owns a 
public monopoly for the production of fossil energy but once they go on the world market 
to sell their production, they are supposed to compete. The cartel was formed in 1960 
after the US government imposed a quota adverse to the interest of oil exporters not 
belonging to the american continent. OPEC's has currently 11 members and meet ap- 
proximatively three times a year to revise the ceiling allocations for production of crude 
oil. These ceiling are voluntary export restraints agreed by all but each member is never- 
theless entirely free to overshoot them. At the start of the 1970s, OPEC's share of world 
production was in excess of one half; it is quite clear on Figure 9.1, computed from OPEC 
(2004), that the strategy of setting binding ceiling allocations in order to reduce world 
supply was effective to raise the price during a decade. The unexpected consequence the 
recession it created together with a depressed oil demand and a severe fall of the market 
price. 

In Table 9.2, computed from OPEC (2004), the bottom row displays the ratio of to- 
tal OPEC production over its own total ceiling allocations; 3 ® overshooting occurs when 
demand is strong and conversely, the quotas fail to bind in periods of low prices and de- 
pressed demand. The country rows display a measure of quota respect by each member; 
less (resp. more) than 100 means that the member behaves better (resp. worse) than the 
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Figure 9.1: OPEC output policy 

average. 4@ Notice that Saudi Arabia is systematically below par (except during the first 
gulf war) because it is the swing member in charge of smoothing the overall production. 
Using the HHI index of concentration developed in §15.3, we may state that oil re- 
serves are concentrated, with HHI of 5963 in 2003 if we consider OPEC as an effective 
cartel, but only 1022 if each country is taken separately. Oil production is rather less 
concentrated, with an HHI of 1900 if OPEC countries are joined, or 571 taking each coun- 
try individually. Oil consumption is slightly more concentrated than production if we 
consider each country individually, with an HHI of 876. 
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Figure 9.2: Quota overshooting by OPEC members 



Another international cartel is the International Air Transport Association (IATA) 
which used to coordinate fares of international air travel. Sectorial deregulation in the 
US and Europe and the new Open Skies agreement has severely cut back its ability to 
maintain high prices especially within integrated economic areas as shown with more 
detail in §17.3.3. 



Guilds and Licensed Occupations 

Since medieval times, guilds of craftsmen and merchants have been examples of pri- 
vate but regulated monopolies who held special privileges (cf Smith (1776), 1.10.72). 
Some were organized as cartels while others were monopoly bestowed by the State (or 
the King). Medieval Kings gave charters to towns allowing them to hold fairs (and tax 
entrance) or to be the sole producer of a commodity (within the country). 

Nowadays, most professions where learning and training is certified by a diploma are 
organized as "associations" and are lightly regulated by the State (cf. §16.2). 5@ prices 
may be negotiated with the government or may be set freely by the guild but their most 
distinctive feature is to be more or less identical in the entire profession. Fundamental to 
the well being of the association is the ability to limit entry in order to sustain monopoly 
prices. Examples include doctors, architects, engineers, accountants or lawyers. 

Their weight of licensed occupations in the economy has grown from 4.5% of the US 
workforce in 1950 to 20% in 2000. This is a large increase but not so much if one takes 
into account the transformation of the economy into services where many hold higher 
education diploma (college enrollment increased seven fold in the same period or pass 
from 1.4% to 5.4% of the population). Law and Kim (2005)'s econometric studies concludes 
that the rise of regulation for many new activities during the progressive era in the US 
was an organized response to asymmetric information problems. As growth in scientific 
knowledge was accompanied by specialization, individuals became less knowledgeable 
about the goods and services they purchased in the marketplace. Moreover, as society 
became urbanized, market exchanges became more anonymous so that traditional mech- 
anisms for overcoming the asymmetric information problem were less useful. However, 
the view we purport in this book is that each of these clique captured its regulator and 
has maintained a too tight supply of services over the last 50 years. 

As far as economics are concerned, unions are also guilds. Hollywood hosts two fa- 
mous guilds, that of actors and writers. The associations representing artists (e.g., AS- 
CAP or SGAE or SACEM) and managing the royalties over the reproduction of their 
works are also legal monopolies whose statute is increasingly questioned as being anti- 
competitive and stifling entry and innovation in these industries. Trade associations use 
public announcements to coordinate their actions; this can be a list of prices, a delegation 
of power to their union representative, production quotas, agreements on product stan- 
dards or limits to entry in the sector (numerus clausus). A very important instrument 
to avoid the price-cutting behavior alluded before is to establish market shares which 
are most homogeneous in order that each member be a true monopoly facing a partial 
demand as elastic as the total one; this way the incentive to price at monopoly level 
be preserved. This kind of market segmentation by geographic origin or sector (service 



vs. industry) is practicable only if there are no uncorrelated demand fluctuations among 
segments that would force to revise too often the quotas or segments definitions. 
To summarize, a cartel is inherently fragile for the following reasons: 

Enforcement: the coordinated behavior is not self-enforcing because of the free rid- 
ing incentives, thus it must be enforced by the threat of punishment in case a 
member does not fulfill his role. 

Sharing: a successful cartel derives some surplus or quasi-rent; we can expect some 
quibbling over its division among members. 

Entry: a successful cartel, by generating above average returns, induces the entry of 
new competitors in the market which threatens the cartel's stability. 

Retaliation: the other side of the market suffering from a successful cartel will un- 
doubtedly try to retaliate. 



9.1.3 Collusion 

Whenever cartels are prohibited, firms willing to form a cartel can instead collude i.e., 
adopt the same behavior aimed at raising prices and restricting output but secretly to 
avoid prosecution by antitrust authorities. 6 ® We view collusion as a broader concept 
than cartel because, although firms might be forced to act individually and are deterred 
from explicitly coordination by law, they might nevertheless recognize their interdepen- 
dence in the market competition and try to behave in ways that signals to others their 
willingness to tacitly collude in order to jointly exercise market power. 

Sustaining collusion is obviously more difficult than sustaining a cartel since every- 
thing has to be kept secret, most notably the enforcement and sharing issues alluded to 
before. For instance, the EC fined Japanese video games maker Nintendo and seven of 
its official EU distributors for having agreed between 1991 and 1998 to maintain price 
differences across the EU. Each distributor was to prevent so-called parallel trade i.e., 
exports from its territory to another country by investigating the use of shipments by 
its clients. Exporters were punished by smaller shipments or boycott. As we show in 
the next paragraph, best-price clauses are strategic devices that facilitate collusion in 
a dynamic setting; antitrust autorities refer to them as facilitating practices. Detecting 
secret price cuts is the primary issue to enforce a collusion. One case where collusion is 
aided rather than deterred is public procurement since it is frequent that all bids, not 
only the winning one, are made public thereby enabling the identification of a cheater. 

McAfee and McMillan (1992) argue that sharing is so controversial among collud- 



ers that it often leads a discontented member to denounce the illegal agreement to au- 
thorities. This is illustrated in a number of decisions by the EC relative to cartels and 
collusion (cf update): 

• In response to increased competition, the auction houses Christie's and Sotheby's col- 

luded to fix commission fees between 1993 and 2000. In 2002, the EC granted 
immunity to Christie's for coming forward with evidence of collusion while it fined 
Sotheby's 20 m€ , (6% of its worldwide turnover), including a 40% reduction for its 
cooperation in the investigation. 

• 478 m€ fines to a cartel of plasterboard producers from Benelux, Germany, France 

and the UK between 1992 and 1998. The incriminated firms had agreed to restrict 
competition on these markets in line with their interests, exchanged information 
on their sales volumes and informed one another of price increases. 

• 124 m€ fines to eight Austrian banks for their participation in a wide-ranging price 

cartel. 

• 125 m€ fines to one European and one Japanese firm producing food for animals; a 

third company was granted immunity from fines because it revealed the cartel's 
existence and provided the EC with decisive evidence. 

• 25 m€ fines to seven Dutch companies for fixing prices of industrial and medical gas 

in the Netherlands. 

• 20 m€ fines to one Japanese and two Korean firms producers of food flavour enhancers 

for price-fixing and customer allocation between 1988 and 1998. A fourth Japanese 
firm was granted immunity from fines because it revealed the cartel's existence and 
provided the EC with decisive evidence. 

Famous price-fixing issues in the US are 

• 1982: IBM won a decade-long battle when charges of illegal monopoly against it were 

dropped. 

• 1983: AT&T was broken into a single long-distance service and the seven "baby Bells" 

local telephone companies. 

• 1998: Eastman Chemical paid an $11 million fine for participating in an international 

price-fixing conspiracy in the food preservatives industry. 

• 1999: the vitamins cartel involving U.S., Swiss, German, Canadian and Japanese firms 

was fined $850 million; a number of top executives went to jail. 

Table 9.2 shows that recent actions against price fixing deal mostly with intermediate 
goods and services. 



Industry 


Years 


Aluminum Phosphide 


1 


Bromine Products 


3 


Cable-Stayed Bridges 


1 


Carbon Cathode Block 


2 


Cartonboard 


5 


Cement 


11 


Citric Acid 


4 


Ferrosilicon Products 


2 


Ferry Operators (Adriatic Sea) 


7 


Ferry Operators (Channel) 


1 


Fine Arts 


6 


Graphite Electrodes 


5 


Isostatic Graphite 


5 


Laminated Plastic Tubes 


9 


Lysine 


3 


Maltol 


6 


Marine Construction Services 


4 


Marine TransportationServices 


5 



Industry 


Years 


Plastic Dinnerware 


1 


Shipping (Central WestAfrican) 


20 


Shipping (Far Eastern) 


4 


Shipping (France -Africa) 


17 


Shipping (NorthAtlantic) 


2 


Sodium Erythorbate 


2 


Sodium Gluconate 


2 


Sorbates 


17 


Stainless Steel 


1 


Steel Beam 


6 


Steel Heating Pipes 


4 


Steel Tube 


5 


Sugar 


4 


Tampico Fiber 


5 


Thermal Fax Paper 


1 


Vitamins 


9 


Wastewater Construction 


8 



Table 9.2: International Price-Fixing US DOJ and EC Actions 

9.2 Vertical Agreements 



9.2.1 Best Price Clauses 



In 1979, the US Federal Trade Commission (FTC) sued four manufacturers of chemical 
additives for gasoline because they had adopted marketing practices conducive of collu- 
sion, the so-called Ethyl case of Post-sale restraint. The defendants reportedly signed 
"most favored nation" agreements, gave 30-day notices of price increases before they be- 
came effective and applied uniform delivered pricing. The price notices usually resulted 
in price increases by all four companies on the same day. From the FTC's point of view, 
coordination among the firms, usually in advance of impending price changes, eliminated 
uncertainty about each others' willingness to follow an upward price lead. Price signal- 
ing through the press was a coordination instrument to maintain uniform pricing and 
reduce competition sharply in an already oligopolistic industry. Although no conspir- 
acy was involved, the FTC decided in 1983 to bar the defending firms from announcing 
price changes before their effective date, using price formulas that systematically match 



a competitor's price, providing sales information to any of its competitors except in con- 
nection with a legitimate sale between them and using "most favored nation" clauses in 
contracts. The US Court of Appeals reversed the decision the next year. 

In our formal analysis, we shall distinguish two clauses, one static and one dynamic. 
A meet or release (MOR) clause sets the current price at the minimum of competitors 
current prices or releases the client from his buying obligation (if the transaction takes 
place before the good or service is consumed). A most favored customer (MFC) clause 
guarantees actual customers they will benefit from a future price reduction offered to 
other customers (of the same firm). 7@ 

The usefulness of the MFC clause for a firm is to reduce the temptation to lower 
prices in order to attract new customers. We saw in §4.3.5 that it could be used by a 
durable good monopoly to mitigate inter-temporal discrimination. Let us show how it 
turns price competition into the more gentle quantity competition when the items for 
sale are durable goods such as domestic appliances (TV, fridge) or durable services such 
as travel tickets. We first observe that, in this context, a sales contract is binding because 
there is no possibility to return the item to the producer or resale it at full price. 8@ 

Consider firms competing on price in a market for an homogeneous good and imagine 
there is a common initial Bertrand equilibrium price p; it generates total sales Q = Dip) 
and individual sales qj for all firms j = l,..,n. Imagine now that firm i sells her product 
with a MFC clause. Whenever firm i offers a discount price p\ upon the current p, she 
gains the x = D{pi) -D{p) new customers attracted by the price decline but cannot steal 
any from her competitors since previous sales are binding. At the same time, the MFC 
forces her to offer the same rebate to all her former clients so that all units are sold at 
price pi < p, her profit is thus 

Tti = pi [c\i + x)- Ci {c\i + x) = {c\i + x)P (0 + x] - Ci {q t + x) 
= qiP[Q-i + qi)-Ci(qi) 

for any q t > q t . The incentive to deviate from the current situation is given by -^ com- 
puted at qf, hence it is only if the current situation is the Cournot equilibrium that this 
incentive is nil for all firms. More precisely, if the current price p is larger than the Cour- 
not price, then at least one firm has an incentive to offer a discount while if the current 
price is lesser than the Cournot one and firms share evenly the market, no one has an 
incentive to offer a discount. 

If we now consider perishable goods like fruits and vegetables then sales contracts 
are generally not binding i.e., if a better price is available at delivery time, the buyer 
can renege his engagement to buy and take advantage of the better opportunity. In that 
case, the previous result holds only if an additional MOR clause protects the firm victim 



of a price-cut from losing any of her clients (they are automatically compensated, thus 
remain faithful). 

Holt and Scheffman (1987) show that MFC and MOR clauses permit to sustain supra- 
competitive prices in a market for an homogeneous perishable good. Firms publish prices 
in advance of the actual trade period, consumers address their prospective demand to 
the minimum of these listed prices; lastly, firms can offer discounts at the trading time. 
In this framework, any price between the competitive one and the Cournot one is an 
equilibrium that will not be under-cut by a discount at trading time. As we saw above, 
agreeing on a price higher than the Cournot one is not an equilibrium since it will surely 
provoke a later discount. On the contrary, any lesser price is immune to an upward 
deviation because consumers stick to the lower listed price and to a downward deviation 
since the incentive identified above tell us that this move reduces the deviant's profit. 
In case the good is durable, the same supra-competitive outcome holds using an MFC 
clause only. 

Schnitzer (1994) goes further to disentangle the respective role of the two clauses 
and gives a theoretical foundation for the widely help suspicion that MOR clauses are 
retaliation devices used to enforce collusive agreement. She consider a market for homo- 
geneous durable goods with new consumers appearing at each period. 9 ® In this dynamic 
setting, the MOR clause makes the detection of cheating on a price-fixing agreement 
very effective since cartel members get informed by the consumers themselves. Then, it 
is possible to punish the deviant by offering a (retaliation) rebate to force him to repay 
this amount to all of his customers, and make a huge loss. It is then possible to sustain 
the monopoly price in all periods except the final ones. Indeed, deviating upward is sim- 
ply dominated while deviating downward brings a one-shot benefit that is annihilated 
by the later retaliation using the MOR clause. 10 ® 

9.2.2 Vertical Restraints 

Vertical restraints (or obligations) are contractual clauses designed by upstream firms, 
most often manufacturers of branded goods like "Adidas" or "Gucci", for their retailers. 
Some restraints have to do with prices and quantities while other deal with differentia- 
tion (cf ch.ll) like the granting of an exclusive territory (horizontal differentiation) or 
the specification of a quality of service to be provided to clients (vertical differentiation). 
When manufacturers and retailers maintain arm's length relationship using incom- 
plete contracts (especially linear pricing), the latter have some freedom to apply their 
market power. Cournot (1838) shows that a "double marginalization" appears prevent- 
ing the industry from fully exerting its market power. As we show in §14.1.4 on double 



marginalization, integration is a way to restore profits but it also comes with costs. Ver- 
tical restraints therefore appear as a lighter alternative. 

The resale price maintenance (RPM) stipulates limits to the price that the down- 
stream firm can charge to consumers. 11 ® 1 This practice is illegal per-se in most major 
developed countries although the theoretical literature has never reached a clear-cut 
conclusion about its efficiency property (harmful vs. beneficial). A reflection of this fact 
is that the EC classifies it as an hard-core restriction (cannot benefit from a block ex- 
emption) but not as an illegal practice per-se. Similarly a 1997 judgment from the US 
supreme court has deemed legal a maximum price in the gasoline retail market. The 
first case in Europe occured in 1964 when the EC declared illegal the granting of ex- 
clusive dealership rights by Grundig, a German manufacturer of household appliances 
to its French subsidiary. The ruling was uphelp by the ECJ in 1966 and expanded the 
definition of measures affecting trade to include "potential effects". 

The strongest arguments against RPMs is that they help enforce collusion at all lev- 
els. Whenever a producer uses the same minimum RPM for all his retailers in a given 
geographic area or market, the retailers become a de-facto cartel since they cannot any- 
more lower the retail price of the product; intra-brand collusion is taking place. For 
collusion among manufacturers (inter-brand), the argument builds on the following ob- 
servation: under a RPM, retailers cannot adjust their retail prices to local conjuncture 
(e.g., macro-economic shocks affecting retail costs or demand). Thus, if a group of collu- 
sive producers use RPMs, retail prices will be quite stable. If now one producer breaks 
the collusive agreement by offering a rebate to his retailers, those will want to pass the 
rebate to their customer in order to boost sales. Hence, the retail price will fall and 
since it is a public information, the cheating manufacturer will be easily identified (and 
punished by fellow conspirators). 

Exclusive territories (e.g., one official shop for the brand per city) is another mean to 
avoid destructive intra-brand downstream competition. In the absence of such a restric- 
tion, retailers would compete harshly, sell at marginal cost and make zero profits. In 
that case, the manufacturer would not be able to use franchise fees to aspire consumer 
surplus from the retailers. If the manufacturer reverts to linear prices then the dou- 
ble marginalization problem reappears. A local monopoly guaranteed by a contractual 
agreement is the easiest way to restore the retailer's margin that can later be shared 
with the manufacturer through the franchise fee. The vertical restrictions are generally 
tolerated on the ground that they permit fair margins for retailers that enable them to 
provide (finance) a service to consumers that is commensurate with the brand image e.g., 
a test stand for perfume with an attendant. 

A restraint aimed at softening inter-brand competition is the exclusivity deal between 



a producer and a retailer whereby the later agrees not to sell competing brand in his 
shop. This completely makes sense for upstream firms like "Adidas" or "Gucci" to avoid 
having their products along those of "Nike" or "Dior", respectively, on the same shelve. 
We study in §10.2.1 how the anti-competitive potential of these exclusive deals with 
respect to the entry of challengers. 

9.2.3 Price Discrimination 

As we already explained in §4, price discrimination for freight transportation developed 
hand in hand with the railway system. In Britain and the US, this practice caused 
unease in many sectors of the economy. The US congress had to pass the Interstate 
Commerce Act in 1887 to regulate this activity, imposing fair and transparent rates, 
prohibiting personal or distance related discrimination. The Robinson-Patman act of 
1936 later prohibited any price discrimination lessening competition. The landmark 
Morton Salt ruling of the US supreme court in 1948 made clear that different prices could 
only be based on different costs, otherwise charging lower prices to large supermarkets 
would injure groceries (and lessen food retail competition). 

These limitations have since then lost their bite and in Europe, no mention of it is 
made in any of the articles relating to economic laws. 

It may therefore be said that price discrimination is a perfectly legal business behavior 
inso far as it does not build directly on characteristics such as sex, race or religion that 
are explicitly protected by laws of higher order. 

9.2.4 Antitrust Activity 

Europe 

Merger scrutinity in Europe is reviewed in §15.1.3. The EC reports that national com- 
petition authorities analyzed some 180 antitrust cases in 2005 (cf. EC (2005)), evenly 
distributed between collusion (art. 81) and abuse of dominant position (art. 82). With re- 
spect to the Commission itself, Table 9.3 gathers some recent trends. 

The fines imposed to cartels by the EC (cf. guidelines)and corrected for court judg- 
ments are reported in Table 9.4. The ten highest cartel fines since 2000 regard eleva- 
tors and escalators, vitamins, gas insulated switchgear, synthetic rubber, plasterboard, 
hydrogen peroxide and perborate, methacrylates, fittings, carbonless paper and lasltly, 
industrial bags. It is quite clear from this list that cartelization occurs mostly in in- 
termediate goods industries. The fight against cartels rests upon the leniency program 



Year 


2000 


2001 


2002 


2003 


2004 


2005 


Opening 














Complaints 


112 


116 


129 


94 


85 


55 


EC Initiative 


84 


74 


91 


97 


52 


39 


Other 


101 


94 


101 


71 


21 


11 


Total 


297 


284 


321 


262 


158 


105 


Closing 














Informally 


362 


324 


330 


295 


363 


207 


Formally 


38 


54 


33 


24 


28 


37 


Total 


400 


378 


363 


319 


391 


244 



Table 9.3: Dominant Position 

giving immunity from fines for the first firm that provides evidence of a cartel to the EC, 
and a substantial reduction in fines for any subsequent applicant. Under the 1996 first 
leniency programme, the EC received an average 13 applications per year. With the 2002 
revision of the programme, the yearly average surged to 41. In 2005, the Commission 
received 17 applications for immunity of which 6 were granted and 11 applications for a 
reduction of fines. 



Year 


2002 


2003 


2004 


2005 


2006 


Total fines 


905 


401 


390 


683 


1846 


Decisions 


9 


5 


6 


5 


7 



Table 9.4: Cartel Fines 



State Aid in Europe 

Before commenting on the EC activity relative to State aid, we provide some basic figures 
from Eurostat regarding the amounts involved. In 2005, the member states at the EU25 
level granted 64b€ of State aid and 40b€ of railways subsidies (which are not accounted 
as State aid because it is considered a public service); this transfer amounts to 1% of 
Europe's GDP. The financing of State aid uses equally grants and tax exemptions (plus 
guarantees and equity participations). Tables 9.5-9.4 gather some geographical and his- 
torical figures for the smaller EU15 set of countries where State aid has been halved in 
GDP terms since 1990. 



Country 


DE 


FR 


IT 


UK 


ES 


SW 


NL 


BE 


State Aid 


20.3 


9.7 


6.4 


4.5 


3.8 


3.1 


2.0 


1.2 


Railways 


8 


10 


6 


6.5 


1.3 


1.2 


2.4 


2.0 



Table 9.5: Public Aid in 2005 (b€) 



Year 


95 


96 


97 


98 


99 


00 


01 


02 


03 


04 


05 


State Aid 


78 


77 


97 


65 


56 


60 


61 


68 


57 


59 


59 


Railways 












33 


42 


41 


39 


39 


39 



Table 9.6: Evolution of EU15 Public Aid (b€) 

There are significant differences between member states regarding the sectors to 
which they direct aid, however at the EU-25 level, 17b€ went to agriculture and fish- 
eries, 4b€ to coal, 2b€ to transport (mostly maritime), 2b€ to manufacture and 2b€ to 
services. The remaining 38b€ represent horizontal objectives, i.e. not granted to spe- 
cific sectors. This form of aid is considered better suited to address market failures and 
thus less distortionary than sectoral and ad hoc aid. Employment received 3.6b€, re- 
gional development 8.6b€, R&D 5.4b€, SMEs 4.5b€ while the remaining 12.6b€ went to 
environment and energy saving. 

The trend in State aid cases investigated by the EC is reported in Table 9.7. A com- 
parison with Table 9.3 shows a different scale of complaints, all the more that complaints 
received in agriculture, fisheries, transport and coal are excluded from available statis- 
tics. This feature is a direct consequence of the volume of State aid. As far as member 
states are concerned, Italy, Germany, France, Poland, Spain and the UK provide the bulk 
of cases. 



Year 


2000 


2001 


2002 


2003 


2004 


2005 


Notified 


883 


892 


822 


657 


661 


690 


Un-notified 


134 


152 


154 


101 


84 


92 


Complaints 


94 


152 


192 


177 


221 


204 


Total 


1111 


1196 


1168 


935 


966 


986 



Table 9.7: State Aid Scrutiny in Europe 



US Antitrust Activity 

It is often said that antitrust in the US by the FTC and the department of justice (DOJ) 
was too active in the 1960s and 1970s, too passive in the 1980s and adequate in the 
1990s, reflecting the changes in power between democrats and republicans. Academics 
rather argue for a paradigm shift with the absorption of Chicago School perspectives into 
the mainstream of antitrust policy in the 1970s and 1980s as illustrated by the data of 
Table 9.8 (cf. Crandall and Winston (2003)). 



Period 


Hor. restraints 


Ver. restraints 


Dominance 


Discrimination 


1961-1968 


2.6 


6.4 


2.6 


64.7 


1968-1976 


1.5 


13.9 


3.3 


5.1 


1977-1980 


5.5 


7 


1.2 


2 


1981-1988 


7 


0.6 


0.4 


0.6 


1989-1992 


6.2 


1 








1993-2000 


7.6 


2 


1.4 


0.1 



Table 9.8: Antitrust Non Merger Cases per Year in the US 



FTC activity based on discrimination cases was critiqued as incompatible with an- 
titrust enforcement; it was almost abandoned from 1970 on. Horizontal restraints en- 
forcement expanded with a focus on professional associations and trade associations. 
DOJ became involved with criminal prosecution of supplier collusion; the number of 
criminal cases rose from 13 to 28 and 80 per year during the 1960s, 1970s and 1980s, 
before falling back to 60 per year during the last decade (but with a dramatic increase in 
penalties recovered). 



Chapter 10 
Barriers to Entry 



This chapter studies a specific abuse of dominant position, the erection of barriers that 
either directly reduce the degree of competitiveness of a market or impede the socially 
beneficial entry of new challengers. 

We first review the standard theory in use before the development of game theoret- 
ical models of entry barriers. Then, we tackle preemption and foreclosure which are 
strategies of entry deterrence only differing by their timing. We also study strategies of 
predation, signaling and attrition which deal with exclusionary abuse of dominant po- 
sition. Lastly, we present the barriers erected by the State, voluntary or not, under the 
heading of Product Market Regulation. 

10.1 Standard Theory 

Concept 

The notion of barrier to entry is due to Wallace (1936) and was formalized by Bain 
(1956) as "anything that allows incumbent firms to earn above-normal profits without 
the threat of entry". The EC (as well as other antitrust authorities) makes a clear dis- 
tinction between the entry barriers resulting from a particular market structure and 
those resulting from the behavior of incumbent firms. Indeed, the historical presence in 
a market bestow incumbents with comparative advantages that act as natural barriers 
to the entry of new competitors. Those listed by Wilson (2002) are: 

• Scale economies: a significant market share is needed for survival. 

• Cost advantage: accumulation of capital, knowledge or experience. 

• Key input: access at low cost to a crucial input or asset. 

• Cost of capital: entrants must pay a risk premium to financial markets. 

Hence, an incumbent firm has often a perfectly legal dominant position in the market; 
what is unlawful is to abuse it. But why should we expect such a behavior? As we saw 



in §6.1, entry almost always decreases the profits of an individual incumbent firm and 
this effect is even stronger for a monopoly. Anticipating this possibility, incumbents 
may be tempted to strategically erect barriers (or heighten natural barriers) in order to 
discourage entry. 

Examples 

Let us first review some examples of barrier to entry in a market. Preemption is to claim 
or preserve a monopoly position by means other than building a cost advantage. The 
incumbent commits costly actions that irreversibly strengthen its options to later exclude 
competitors. This behavior is not illegal per-se, it is only some of the later behavior that 
might be. 

In 1994, the US-based biotech company Agracetus won a European patent on all 
genetically engineered soybean varieties and seeds and all methods of transformation, 
this despite the opposition of Monsanto and other producers of fertilizers and seeds who 
argued for lack of novelty or inventiveness. After being overruled, Monsanto bought 
Agracetus two years later in a clear strategic preemption of the desired patent. This move 
has successfully insulated Monsanto from competition; in 2001, this company provided 
more than one-half of the soya crop worldwide. Oddly enough, the patent was revoked 
in 2007 for the reasons initially argued against it. 

Other examples of preemption, deemed anti-competitive, are exclusive contracts with 
upstream or downstream partners and clients, brand proliferation, 1@ excessive produc- 
tion capacity, buying patents but not using them (cf §12.2.3), advertising, increasing 
quality beyond the optimal level. 

A frequent concern for the EC is Foreclosure, 2 ® the restriction of market access to 
potential competitors either upstream or downstream by absolute refusal to deal, degra- 
dation of the quality of access, preemption of important sources of raw material supply 
or preemption of distribution channels through exclusivity contracts. Even a govern- 
ment can help to erect a barrier through licensing requirements and other regulations 
that slow the entry of (foreign) competitors. Examples of Foreclosure are historically 
numerous, they deal with access to a stadium, a railroad bridge, a train station, a har- 
bor, a power transmission network, a local telecommunication network, and a computer 
reservation system. 

Bain-Sylos Postulate 

As recorded by Modigliani (1958) in his review of the books by Bain (1956) and Sylos- 
Labini (1957), the small number of active firms in an oligopoly might well be due to 



entry barriers of technological or administrative nature; this is what the Cournot model 
implicitly assumes in that it takes the number of firms as a parameter. Endogeneizing 
entry is made possible by the Bain-Sylos postulate. It is worth noticing that at a time 
where game theory was not available to systematize the study of sequential interactions, 
this postulate enabled to derive meaningful predictions, matching factual observations. 

The Bain-Sylos postulate states that an incumbent won't change his production or 
price after entry. . 3@ This behavioral assumption turns the incumbent into a Stackelberg 
leader (cf. §6.2). The second step of the analysis is to compute the limit price which, 
in some cases, is the minimum of the entrant's average cost. By using that price, the 
incumbent convinces the entrant that she will make losses upon entering the market. 
The entrant being rational and believing the postulate will therefore choose not to enter 
when observing the limit price. Alternatively, a limit quantity would play the same 
signaling role. Notice that the higher the fixed cost of entry, the higher the limit price 
i.e., the easier it is to block entry. 

Now, the incumbent can decide to use or not the limit price. If the fixed cost of entry is 
extremely large, the incumbent can continue to use the monopoly price because it is low 
enough to deter entry; this situation is called "blockaded entry". For a lower fixed cost, 
the incumbent optimally chooses to use the limit price to impede entry. Finally, if the 
fixed cost is low, the best strategy for the incumbent is to allow entry; although blocking 
is still a possibility, it has become too costly. 

Critique 

Although it has proved immensely useful, the Bain-Sylos postulate is seriously flawed as 
it goes against the basic rationality of economic agents; if there is entry of a challenger, 
there is a change in the market structure, in the conditions of competition and therefore 
a new pricing behavior is called for. Stating that such a reaction won't take place is all 
about reputation. 4@ 

When trying to assess the plausibility of this hypothesis, one is lead to wonder how 
costly it may be to build this predatory reputation and also whether there wouldn't be 
an alternative and more profitable strategy. Dixit (1979) is among the first to provide 
a game theoretical analysis of entry where these questions start to be answered. His 
model presented in §10.2.3, uses game theory to assess the cost and benefits of erecting 
anti-competitive barriers in a framework where firms are unable to make non credible 
commitment. 

There is a wealth of examples showing how quickly firms react to entry. Google pro- 
vides us with a series of innovation that have spurred rapid reaction. This company 
unveiled its "desktop search" software on 14/10/04; the same day, AOL announced that 



it would soon launch of a similar engine. On 9/12/04, Yahoo announced it would start its 
own desktop search in January 2005; it was effective on 11/01/05. Finally, MSN, the net 
division of Microsoft launched its own on 13/12/04. The same story went on with Google's 
"1GB email account" and with their "browser search bar". 

In a slightly different register, Microsoft whose new OS was being scheduled for beta 
testing in 2006 advanced the launch of a beta version together with the marketing name 
"Windows Vista" on 22/07/05, only 3 months after Apple's introduction of its "Tiger" OS. 
Considering the number of years between each of Microsoft OS launches (W95, W98, 
XP), this is a short span of time which has been interpreted by specialists as an intent by 
Windows to demonstrate to the general public, that it is too active on the OS technology 
front. 

10.2 Preemption 

In this section, we review several preemption strategies that aim at excluding chal- 
lengers whether they are actual competitors or potential entrants by means of exclusive 
contracts and/or excessive capacity building. 

10.2.1 Exclusive Contracts: Efficiency vs. Exclusion 

An exclusive dealing requires a buyer to purchase products or services for a period of 
time exclusively from one supplier. Prior to the modern era, this practice is understood 
as a well groomed relationship. Its increasing use can be traced to industrialization 
when firms start to specialize and cease carrying the entire chain of production down 
to the final consumer. Prior to the XX f ^, exclusive dealings are seen through a laissez- 
faire lens as partial and thus not unreasonable (cf. rule of reason) if founded on risk 
(supply, price) or protecting investments (cf. §13.3.3). They are routinely upheld in court 
against legal challenges as soon as there existed a viable alternative (i.e, the market is 
not monopolized). 

During the Progressive Era, the dominant view in the US becomes that a market 
leader can impose exclusionary contracts to its own benefit and to the detriment of con- 
sumers by foreclosing the entry of challengers. The Clayton Act (cf. §8.3) of 1914 ad- 
dress this concern by outlawing tying and exclusive dealing arrangements which could 
lessen competition substantially or tend to create a monopoly. Gilbert (2000) notes that, 
although these arrangements are not qualified as illegal per se, no serious test of in- 
fringement is envisioned so that the spectrum of applicability is quite large. Applying 
the new law, courts have stricken down exclusivity contracts as anti-competitive in cases 



involving tying (e.g. §24.1), exclusive dealing (cf. Hollywood movies distribution), and 
long-term contracts. The most common turn-around to this prohibition for a manufac- 
turer are own stores, franchising and the more recent store-within-a-store concept (cf. 
Jerath and Zhang (2010)). 

Starting in the 1950s, the Chicago school offers a pro-competitive defense of exclusive 
dealings (cf. Director and Levi (1956) & Bork (1967)) which has proven influential and 
lead to a greater acceptance of these practices in courts. The case rest on a version of 
the Coase theorem (cf. §2.4.3): contracts are voluntary, not imposed, and must therefore 
maximize the combined benefits of the contracting parties. Contractual terms, apart 
from total price, must therefore be explained as wealth-maximizing, not as the result 
of relative market power or bargaining power. The first welfare theorem would seem to 
follow: if a contract maximizes the combined benefits of signatories, it must be efficient. 
If true, government intervention that limits the set of feasible contracts cannot improve 
welfare; on the contrary, it impedes parties from achieving all the efficiencies at their 
reach. 

The validity of this claim is however limited since it may fails when there are non 
competitive third parties or more generally when we take into account externalities. At 
the same time, simple changes of seemingly innocuous assumptions can sway the conclu- 
sion one way or another so that, ultimately, one must work each case very carefully in or- 
der to make a call as to whether exclusivity contracts are anticompetitive (exclusionary). 
Rasmusen et al. (1991) show that uncoordinated buyers may be "tricked" into accepting 
contracts that, as a group, they would refuse. 5 ® Simpson and Wickelgren (2007) then 
show that the ability to renegotiate contracts brings back efficiency so that exclusive 
dealing cease to be anti-competitive. Next, if buyers are in fact retailers competing in- 
tensively, then exclusive dealing is a vehicle to extract surplus from final consumers for 
the the joint manufacturer-retailer association; this is inefficient, thus anti-competitive. 
Abito and Wright (2008) further show that two-part pricing and/or discriminatory pric- 
ing and/or sequential contracts (divide and conquer), make exclusion cheaper so that the 
manufacturer effectively exclude entry, regardless of the degree of downstream competi- 
tion and any cost advantage of the entrant. 

Modern cases of exclusivity deals suspicious of dampening competition are the man- 
agement of rights for music, football or movies, fees by VISA, licensing schemes by Coca- 
Cola in Europe, Intel, Yahoo, Apple, AT&T or Google. 



10.2.2 Contracts as Barriers to Entry 

In a Take-or-Pay Contract, the buyer agrees to purchase from the seller a fixed quantity 
of a product for a given price over a certain period of time. Irrespective of the quantity 
which is finally needed and transferred, the buyer is bound by its commitment and is 
required to pay for the whole volume of sales at the contractual terms agreed upon. In 
such a contract, there is a transfer price for exchanged units and a (generally lower) 
breach price for non-exchanged units (that were nevertheless agreed upon). Hence, we 
are faced with an option contract whereby an option fee is paid up-front for the right to 
buy at a discounted price in the future. 

Such clauses are often seen as anti-competitive and may be prohibited but it is only 
recently that a formal proof of this assertion was provided. We shall see how exclusivity 
clauses modify competition, and therefore, the entry decision of potential competitors. 

Aghion and Bolton (1987) explore preemption within a sequential model where a chal- 
lenger has the option to enter a market initially controlled by an incumbent monopoly. 
In case of entry, a duopoly competition takes place between the incumbent and the chal- 
lenger. We impose the following rationality condition that is absent from the contesta- 
bility theory or the Bain-Sylos postulate: neither the challenger nor the incumbent can 
credibly pretend to behave in a given manner during the duopoly competition (cf §6.1.7). 
Both anticipate that during the duopoly competition, both will take whatever action is 
optimal at that moment. In other words, you can tie your hands for the future by invest- 
ing heavily into a specialized equipment but you cannot credibly pretend that you will 
wait tomorrow to make such a costly investment. 

Competition without Contracts 

An incumbent (monopoly) seller / can offer a contract to a buyer B at date 1 to impeach 
entry of a competitor E at date 2. Actual exchanges occurs at a later date 3. For an- 
alytical tractability, we define a contract as [p c , d c ) where p c is the agreed price and d c 
the damage 6 ® paid by B to / if B breaches the contract i.e., buy from E instead of I. In 
case of entry, / and E compete in prices at date 3. We shall see that contracts constitute 
a barrier to entry if post-entry profits for the incumbent are lower than the pre-entry 
profits. 

It is public knowledge that the incumbent's cost is c/ = ~, while the entrant's cost ce 
is private information; thus, a contract cannot be contingent on eg. Nevertheless, it is 
known that ce can take any value between and 1. The buyer values the good at 1 so 
that trade is always optimal. The default payoff for E if she does not enter the market is 
set to zero. 



To show that a contract can be useful for the incumbent we first analyze the no- 
contract situation. The game tree is shown on Figure 10.1 below where N stands for 
nature who draws the entrant's cost ce at random in [0; 1] (uniform distribution). 
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Figure 10.1: Entry Sequence 

We analyze this game by backward induction. The buyer chooses to buy if pi < 1, his 
willingness to pay for the good; this gives us the rule to choose among branches (1) and 
(2) on Figure 10.1. The optimal strategy for the incumbent is thus to propose p\ = 1 so 
that the no-entry profits are 7r° Mf = 1 -a - \,n B ut = n E ut = 0. 

If E enters then both firms compete in Bertrand fashion to win the buyer's purchase. 
The winner is the firm with the lowest marginal cost and the price is the highest mar- 
ginal cost max{c/,C£}. If the entrant is the loser (case ce > c/), she earns zero profit and 
therefore chooses optimally not to enter. There is entry only if the entrant has a better 
technology i.e., ce < c/, which leads to profits % l E = a - ce > 0, 7i\ n = and n l B n = 1 - c/. The 
probability of entry is thus (p = Prob{CE < c/) = \. We deduce the expected (before Nature 
chooses ce) profits of the game without contracting: 
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Competition with Contracts 

When a contract {p c ,d c ) between the incumbent and the buyer is legal, the game tree is 
modified as shown on Figure 10.2. The incumbent price p c is now set before the hypothet- 
ical entry of the competitor; thus if the contract is accepted by the buyer, the incumbent 
gains commitment power: he commits not to enter into the price war that was taking 
place in the absence of a contract. Since this effect is potentially bad for its profit, the 
incumbent uses the second instrument, the damage d c , to compensate himself. 




Figure 10.2: Contract as a Barrier to Entry 

From the analysis of the no-contract game we know that the buyer will accept the 
contract only if he expects a profit of \ (or more). Now observe that after accepting 
{p c ,d c ), the buyer either gets 7r° Mf = 1 - p c if there is no entry because the price is fixed or 
n'g > 1 - p c if there is entry because at worst he sticks to the contract. We thus conclude 
that p c < | must hold for the contract to be accepted. 

Since B must pay d c to / if he buys from E, he will do so if and only if ps< p c - d c . 
Given this behavior, the optimal price set by E is p c - d c but it has to be larger than the 
marginal cost to make entry profitable, thus the probability of entry is exactly p c - d c . 
The main difference with the no-contract case is that the entry decision depends only on 
the terms of the contract and not anymore on ci, the technology of the incumbent. 

It remains now for the incumbent to design an optimal contract; when the contract 
iPc,d c ) is accepted the incumbent's profit is 

n I {p c ,d c ) = {p c -dc)dc + {l-pc + d c ){pc-c I ) (10.1) 

The two parts in (10.1) correspond to entry and no-entry of the competitor. As ^ = 
2 (f ~Pc + d c ) and p c < §, we observe that necessarily ^ > 0; this means that the optimal 
price is precisely the limit |. The intuition is that given the commitment power of setting 
the price long in advance, the incumbent would like to raise the price a lot. Yet this would 
make the buyer refuse the contract in the first place, thus the participation constraint 
p c < I will be binding. The incumbent's profit now reads 

ndd c ) = {\-dc)dc + {\ + dc)\ (10.2) 

The optimal damage, maximizing (10.2), solves the FOC § -2d c + \ = Q o d c = \; it 
yields a profit of ^ greater than \, the profit in the absence of contracts. This ultimately 
proves the usefulness of a contract to limit entry and raise profits. 

Observe that, upon signing the equilibrium contract, entry takes place when c e < 
p c - d c = \ which is inefficiently low. For that reason, it may be socially desirable to 



prohibit exclusivity clauses to guarantee a maximal level of competition. Notice finally 
by examining (10.2) with d c = I that the incumbent earns more when entry occurs (^) 
than when it does not (|). This implies that the incumbent will never wish to renegotiate 
the contract making this strategic barrier all the more robust. 

An incumbent seller facing a threat of entry into his market will sign long-term con- 
tracts that sometime prevent the entry of a lower cost producer. Although contract 
are freely entered by the two parties, welfare is reduced on expectation because better 
technologies are frequently barred from replacing obsolete ones. Long-term contracts 
between buyers and sellers can therefore be deemed anti-competitive. 

This conclusion can be summarized with the help of Figure 10.3 where the distribu- 
tion of surpluses makes clear how the incumbent is able to absorb part of the welfare 
addition bring about by the cost innovation of the entrant. 
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Figure 10.3: Contract as a Barrier to Entry 



Exclusion as a Surplus Extraction Vehicle 

Jing and Winter (2010) synthesize the whole debate around the potential anti-competitive 
nature of exclusive dealing as follows: the signatories may end up better off even though 
their deal is inefficient (welfare-reducing) if they succeed to increase their share by a 
large margin (of the welfare). They consider an incumbent manufacturer selling to non 
competing retailers (e.g., local monopolies). To maximize their joint profit, the manu- 
facturer and a retailer minimize their overall cost. When the new technology has a cost 



below the exercise price x, challengers enter and Bertrand competition among them force 
their price towards their cost c. The expected cost is thus C{x) = J* X cdH+ (1 - H(x))ci. The 
optimal exercise price solves C'[x) = and is x = a, so that entry is efficient and not 
blockaded. 

Now, if there is a sole challenger (as in Aghion and Bolton (1987)), he will monopolize 
manufacturing upon entry i.e., he will always sell at x whenever c < x. This makes 
the cost of manufacturing dearer and leads to an inefficient optimal x because the pair 
incumbent-retailer will want to reduce outsourcing wrt. the previous case. Formally, 
the expected cost C[x) = xH{x) + (1 - H[x))ci satisfies C{ci) = Hia) > C'ici) = so that the 
optimal choice is smaller and exclusion ensues. 

Naked exclusion a la Rasmusen et al. (1991) takes into account the minimum scale 
of operation needed to support entry of a challenger when the downstream market is 
populated by many firms or consumers. The incumbent can then exploit buyers' lack 
of coordination to block entry by signing exclusivity contracts with enough downstream 
buyers to shrink the remaining market. The underlying phenomena is that every buyer 
who signs an exclusive contract imposes a negative externality on other buyers. When 
many have already signed or are about to do so, any free buyer realizes that his choice 
will not affect the entry decision of a challenger and can thus be bribed at a very low 
cost by the incumbent. At the limit, the incumbent brings all buyers on board. When 
buyers are able to coordinate, exclusion is profitable only if the amount earned from all 
buyers thanks to monopoly pricing, exceeds the minimum amount that must be paid to 
signatories to exclude entry. 

Analytically, if m retailers out of n are signed with the incumbent, the zero profit 
condition among competing challengers implies = mx + {n-m)p-nc-F i.e., pj = F+ ™ ( \ t ^ lx ■ 
Indeed, the m captive buyers are lured with a price of x (undercutting the incumbent) 
while free buyers enjoy the price war between the two entrants. If this price is above the 
WTP of buyers v (i.e., c > c-i), entry is impossible since the incumbent cannot be profitably 
undercut. On the contrary, if this price is below the incumbent's cost c/ (i.e., c < C\), he 
can't impede entry so that we have 3 regimes according to whether the new technology 
cost c stands. In the intermediate case, entry is blocked by lowering the free buyer's 
price down to pf. The joint surplus of the signatories is m{v - C(x)) + {n - m)n f where n/ 
is the profit made over free buyers and C{x) the expected manufacturing price. Observe 
now that n/ = v{l - H{c 2 )) + f^ p/dH and it is a matter of algebra to check that this is 
decreasing with the exercise price x? @ Lastly, since C = xH{c\) + c/(l - H{c\)) is increasing 
with x, 8@ the joint surplus rises when pushing the exercise price below the incumbent's 
cost which is inefficient, thus anti-competitive. 



10.2.3 Preemption with Capacity Building 

Notable cases of exclusion by means of building an excessive capacity are Alcoa and 
Dupont as recanted by Ghemawat (1984) (sec. II). 

Dixit (1980) is one of the early applications of game theory to industrial organiza- 
tion showing that preemptive (excessive) capacity building can be a deterrence strategy 
successful to block entry. The idea here is quite simple: if you build a large and costly 
capacity you credibly convince potential entrants that competition in the future will be 
fierce; indeed, now that you have a large capacity, there is nothing that can impeach 
you from flooding the market if you want to. Reasoning backward, the challenger under- 
stands that upon entry the market will be flooded, the price will be so low that he will 
not cover his fixed out of his producer's surplus, thus he rationally abstain from entering. 

In this section, we present the model that capture this idea and proceed to demon- 
strate its correctness. The incumbent firm / can enlarge its production capacity kj before 
the potential entrant E decides to enter or not the market (at a fixed cost F). If there 
is entry, firms compete on quantities a la Cournot for the demand Dip) = I- p. There is 
one (best) technology available to both firms; the cost of a unit of capacity is 6 and once a 

{c if q < k 
where 
c + 9 if q > k 

6>6 indicates that investing ex-ante into capacity reduces the ex-post marginal cost. 

Best replies in quantitiesf 

We show that producing beyond capacity does not appear in equilibrium. Indeed the 
profit of firm i (letting j be the other firm) is H t {qi,qj) = qt{\ - qt - qj) - cqt if q% < ki, 
Riiqi,qj) -qdl-qi-qj) -cqi-Oiqi-kt) otherwise. The unconditional argument maximizer 
ofU- is — y— - (the plain line on Figure 10.4 below) while that of n, is — %^- — (the dashed 
line). The best reply to qj, drawn in bold face on Figure 10.4, is — %^~ — over [0;Ar ; ], then 

1 — a — c 

ki and then — j- — over [kf,l]. 

It is worthwhile for firm i to incur high extra production cost (pay 6) to cover the 
market if qj is very low because the marginal benefit is greater than c + 0. At the other 
extreme, if qj is very large, the marginal benefit for i is very low and it is optimal to re- 
treat to a low production, thus his capacity will not be completely used. For intermediate 
qj the optimal response of firm i is to produce up to capacity, no more, no less because 
the price is between c and c + 8. Analytically the best reply is 



BRdqj) = 



— %}- — if </j <max{0, l-c-0-2fc/} i) 

ki otherwise ii) 

1 — Cf — C 

— j- — if qj >max{0, 1 -c-2k{\ Hi) 



Aqi 






l-c 








2 








l-c-e 








2 

ki 








s „ 


\. 1 


— > 



Figure 10.4: Quantity Best Reply 

Cournot Equilibrium and Capacity Reactions 

As can be seen from Figure 10.4 the best reply curves can intersect at most in 3 ways, 
over the first segment (i), the second (ii) or the third (iii). 



i) The solution of \ qi = — %^- — , 



4j 



l-c- 



2 | is q\ = q". = * 3 - . It is an equilibrium only 
if q*<l-c-6-2ki for i = 1,2 thus if k t < ±=%£ for i = 1,2. 



ii) In this middle case, we have q* = kt,q* = kj) it holds when both kt and kj are between 



1 ~ t '- and ^f£. 



iii) The solution of jg ; - = — ^—,qj - — f— -\ is ^* = ^* = ^ together with the necessary 
condition A;, > ^ for i = 1,2. 

Case (i) is valid if capacities are small in which case the profit of firm i is n, = cte- 
®\ 3~ e ~ *-') ~ ^^'- As > 5, rij is increasing with kt thus the best reply of firm i is to 
increase its capacity. Case (iii) is the opposite, it is valid for large capacities and we have 
Ui = cte-8ki, thus the best reply of firm i is to decrease its capacity. 

These two possibilities indicate that in a perfect equilibrium, intermediate capacities 
are chosen i.e., case (ii) applies. The profit of firm i is Ui(ki,kj) = ki(l-ki~kj-c-5), hence 

1— 8—c— k- 

the best reply to kj is ki = ^ — -. In the next part we will use this optimal behavior. 



Blocking vs. Allowing 

Since the incumbent plays first with k\, the challenger optimally enters with k 2 {k\) = 
l ~ 5 ~2~ kl . Its resulting profit is thus n 2 (^i) = \ (1 - 8- c- k\) 2 - F. The incumbent firm can 
make entry non profitable by choosing k\ > k\ CF) = 1 - 6 - c - 2\pF, but this need not be the 
optimal policy especially when k\ is large because building capacity is costly. We analyze 
this choice as a function of F, the fixed cost of entry. 



If F is very large, then k\ (F) = so that the incumbent is an unconstrained monopoly. 
The optimal capacity is k^ = 1 ~ c 2 ~ 5 because, although it is paid in two stages, the total 
marginal cost is c + 8. It yields the monopoly profit fii = |(1 - c-6) 2 . This situation is 
possible if Jfci(F) < fc* oF>F 3 = (1 ~^~ <?)2 . 

If on the other hand F is small, then ki (F) is large which means that the incumbent 
has to build a wasteful large capacity to block entry. Then, having already paid 6ic±{F), 
the relevant marginal cost is c so that the optimal sales are ^ yielding the total profit 
n^F) = J(l - cf - 6h(F). This solution is feasible only if k^F) > ^ o F < F 2 = (1 " c 1 g 2g)2 . 
If F 2 < F < F 3 , the optimal sales are constrained by the capacity and the total profit is 
2Fk x {F). 

With this preliminary information we can proceed to analyze the optimal behavior 
of the incumbent. Given that blocking entry is costly when F < F 2 , it may be optimal to 
accommodate entry. In that case, the price resulting from the Cournot equilibrium is 
p = l-h- k 2 {ki). The profit is thus rii(A:i) = ki (l - /q - k 2 {ki) - c)-6ki = /q 1 - g ~ c - fc i which is 
50% of the monopoly profit. The optimal capacity is thus the monopoly one, A;*, leading to 
an optimal reply k* = k\l2 and equilibrium profits fix/2. It will be optimal to allow entry 
if lii/2 > IIjOF) o F < Fi for another threshold Fi that we assume smaller than F 2 (8 small 
enough). 

In conclusion, the optimal behavior of the incumbent depends on the fixed cost of 
entry: 

• if F < Fi, build the monopoly quantity A;*, allow entry and sell A;*. 

• if Fi < F < F 2 , choose an excessive capacity ki{F) > /cj", block entry and sell ^ < ki{F) 
(some capacity remains idle, there is a waste). 

• if F 2 < F < F 3 , choose an excessive capacity ki{F) > Arj", block entry and sell up to 
capacity. 

• if F > F 3 , choose A;*, block entry at no cost and sell the monopoly quantity. 

To conclude on capacity as a barrier to entry, if the fixed cost of entry is 

very large: the incumbent is a standard monopoly and behaves as such. 

large: the incumbent builds an excessive capacity and uses all of it to block entry. 

intermediate: the incumbent builds an excessive capacity and uses part of it to block 

entry. 
small: the incumbent builds a capacity equal to the monopoly sales, allows entry and 

earns half of the monopoly profit. 



10.3 Foreclosure 

The economic definition of foreclosure puts the emphasis on the denial of proper access 
to an essential facility, often called a bottleneck. According to the EC, the latter is an in- 
frastructure (e.g., national electricity power grid or internal code of an operating system) 
which is necessary for reaching customers and/or enabling competitors to carry on their 
business. It is deemed essential if its duplication is impossible or extremely difficult due 
to technological, physical, geographical, legal or economic constraints. Denying access 
to an essential facility may be considered an abuse of a dominant position by the EC, in 
particular where it prevents competition in a downstream market. 

The question that immediately comes to mind is, why would the owner of an essen- 
tial facility refrain from selling the use of his facility to anyone ready to pay for it? After 
all, he has a monopoly for that very valuable service. To understand the rationality of 
foreclosure we can draw an analogy by seeing a bottleneck owner as a durable-good mo- 
nopolist (cf §4.3.5): once he has contracted with a downstream firm for access to his 
essential facility, he has an incentive to provide access to other firms as well. This is 
an inter-temporal commitment problem where Coase (1972)'s theorem tells us that the 
monopolist does not obtain the monopoly profit because he creates his own competition: 
by selling more of the durable good at some date, he depreciates the value of units sold 
at earlier dates; the prospect of further sales in turn makes early buyers wary of expro- 
priation and makes them reluctant to purchase initially. This is why it is better to limit 
access in the first place. 

10.3.1 Theory 

We follow a simple model from Tirole (1988) where an upstream firm U produces an input 
at marginal cost c; she monopolistically supplies downstream firms A and B. The latter 
produce homogeneous goods using the input on a one-for-one basis and at zero marginal 
cost. The WTP for the final product is P{q). The industry's monopoly profit q{P{q) - c) 
corresponds to the vertical integration of downstream firms with the upstream one. This 
profit is maximum for an output q M , price p M = P{q M ) and yields the maximum profit 
n M . 

The timing among independent firms is as follows: 

1. U offers firms A and B tariffs T A {.) and T B (.). 

2. Each firm orders a quantity of input. Orders are observable. 

3. Firms A and B compete in prices with capacities q A and qs because the transforma- 
tion of the input is time consuming. 



We saw in §5.2 that Bertrand competition with capacity constraints leads both firms 
to name the short term competitive market price P{q A + ^b)', hence the second stage is a 
Cournot competition. Denote BR{q) the quantity best reply of a firm when expecting the 
other to order q (cf. §5.1 on symmetric Cournot competition). To analyze the first stage, 
we need to distinguish whether the tariff offered to one downstream firm is observed by 
the other or not. 

If secret contracts are not available (for either legal or technical reasons) then U can 
get the entire monopoly profit by offering each downstream firm to buy \~ f° r a total 
price t M = \q M p M . Both firm accept, get their zero reservation value, the monopoly 
output is produced and sold. In this world, there is no rationale for foreclosure since the 
upstream monopolist need not exclude any of the competitors. 

The previous offer ceases to be credible if contracts are secret or can be privately 
renegotiated. Indeed, U could offer A a new contract {qA, t A ) that maximizes their joint 

a M 

profit, call it 11a iqA), conditional on B having naively accepted to buy ^-. To see this 
observe that 

n A (q A ) = \p M q M + q A (P [\q M + q A ) - c) 

a M a M 

is independent of the transfer t A and is maximum for q A = BR{^-) > ^-; this optimal 
deviation is the Cournot best reply that an integrated U-A firm would choose when facing 

a M 

an integrated U-B firm selling -^-. Anticipating this future flooding of the consumer 
market, B would refuse the initial offer (which was leaving him zero rent). 

A symmetric offer iq*,t*) by U will form an equilibrium only if downstream firms 
do not fear a profit damaging secret recontracting. Since the best secret deal involves 
the quantity BR{q*), a firm will be welcomed it whenever it is lesser than the original 
q* because this can only increase the final price. The equilibrium constraint is thus 
BR{q*) <q*oq*> q c , the Cournot individual production. The meaning is clear: it is only 
by agreeing publicly to sell much that no downstream firm will fear a secret renegotiation 
aiming at flooding the market. It remains to tailor t * to leave no rent to the downstream 
firms. 

If secret contracts are feasible, the upstream firm cannot refrain from double-crossing 
each of her customers with the other, thereby generating a flooding of the market 
that ends up lowering her profits to the Cournot level. This is why contracting with a 
single downstream firm is optimal; foreclosure is then at work since other downstream 
candidates will face a rebuttal. 



10.3.2 Foreclosure strategies 

We now understand why foreclosure can improve the monopoly's situation in the pres- 
ence of secret contracts. By signing an exclusivity contract with one downstream firm 
the monopoly commits not to sell to the other downstream firms and restores her profits 
to the monopoly level (no secret contracts case). The available strategies to achieve this 
objective are: 

• U can enter into an exclusive agreement with a single downstream firm. 9@ 

• U may integrate with one of the downstream firms. Indeed, by supplying q m to its 
downstream subsidiary the integrated firm earns the monopoly profit and could 
only loose from supplying another downstream firm. 

• U can offer a protection clause to one client to solves the commitment problem. 
Since the production of a firm is observable and the technology is deterministic, his 
purchase of input is also observable, hence U may penalize herself with a large side 
payment to D t if she were to sell more input to Dj. 

• U offers a resale price maintenance (RPM) (minimum price for the downstream 
market) together with a return option for unsold input units (cf. §9.2.2). 

To repeat, even though the upstream firm is in a monopoly position, her inability to 
credibly commit gives room for opportunistic behavior and prevents her from achieving 
the monopoly outcome. Foreclosure is a way to restore profits but it clearly lowers welfare 
since the final production is bound to decrease. If there are more downstream firms, say 
n, the commitment problem is worse since the no-foreclosure equilibrium price tends to 
the marginal cost c as n becomes large. 10@ Thus, the more competitive the downstream 
industry, the more likely that foreclosure by the bottleneck owner takes place. 

Also, it is important to note that foreclosure is more likely to occur the higher the 
bottleneck appears in the production chain. If, contrary to the case presented above, the 
bottleneck owner is a downstream monopoly (in direct contact with customers) and buys 
inputs from competitive upstream firms then she can internalize the negative externality 
between providers and is thus induced to maintain monopoly prices. The reason is that 
she can at the same time extract all producer surplus from upstream firms and charge 
the monopoly price to final consumers. In the former case, the connection between final 
consumers and the monopoly was indirect; any attempt to extract their surplus was 
doomed to fail because the intermediaries, the downstream firms, were opportunistically 
trying to grab a part of it. 

Seen from a welfare point of view, the better case is when the bottleneck is high 
in the production chain. Indeed, if foreclosure can be avoided then lower prices and 
greater production take place. This has lead antitrust authorities to adopt the "common 



carrier" policy 11@ of forcing the bottleneck to operate upstream. For example it is better 
to have a GSM network owner like VODAFONE in Spain establish a telephone company 
and buy access to fixed lines (local loop) from the national monopoly TELEFONICA than 
letting the national monopoly market GSM telephony and buy the network service from 
VODAFONE. The important task is then to make sure that the historical monopoly does 
not foreclose the use of its fixed lines network. It is socially desirable to ensure that the 
most competitive segment of the market has access to final consumers. 
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Figure 10.5: Common Carrier Policy 

Since in practice foreclosure is often assimilated to the discrimination of downstream 
firms, it has been proposed to make discrimination illegal. This is a very short sighted 
response because it gives the monopoly the commitment it most desires. Indeed, if the 
monopoly is forced to offer the same contract to downstream firms A and B, there is no 
way for him to make a secret and different contract with A because B could easily prove 
that the input was provided under different conditions. Hence, the upstream monopoly 
earns the maximum benefit. True, foreclosure has been eliminated but a more inefficient 
outcome has replaced it ! 



10.3.3 Cases and Remedies 

Some historical cases will help the reader to remind that foreclosure can be found in 
many places and settings: 

• The first case dates from 1912 and involves a key bridge across the Mississippi 
River in Saint Louis; it was owned by a set of railroads companies who excluded 



nonmember competitors from using it. The Supreme Court ruled that this prac- 
tice of tying was a violation of the Sherman (antitrust) Act (cf Reiffen and Kleit 
(1990)). The EC J followed the same doctrine for the first time in 1978 against the 
US banana company United Brand for its distribution behavior. 

• Associated Press, a newspapers cooperative was condemned in 1945 for blocking 
membership by competing newspapers. 

• In 1984, the US Civil Aeronautics Board forced the major airlines who had created 
a computer reservation system to give a fair and transparent access to smaller 
airlines. In 1988, the EC fined the Belgian airline Sabena for a similar access (cf. 
Fisher (1999)). 

• In 1989, the UK Monopoly and Mergers Commission considered that "pubs" were 
an essential facility to the access the consumer market and forced the majors brew- 
ers to divest their chains of pubs (cf. Waterson (2010)). 

• In 1991, the ECJ held in the Port of Genoa case that the harbor is an essential 
facility and that its use should not be reserved to the undertaking managing it. 

• For years Intel refused to share technical details of its microprocessor chips with 
companies that build computers unless those companies agreed to turn over their 
own technologies in exchange. The US Federal Trade Commission was seeking 
mandatory non-discriminatory licensing of the data on reasonable terms. A settle- 
ment of an antitrust suit was reached in 1999 one day before the start of formal 
hearings. 

Rey and Tirole (2007) identify 6 set of policies used by competition law practitioners 
to remedy foreclosure: 

1. Structural policies such as divestitures (e.g., UK brewers analyzed by Slade (2004)), 
line of business restrictions or common ownership by users (e.g., Associated press) 
are often considered in last resort, as they may involve substantial transaction 
costs of disentangling activities. Yet, policy makers may come to the conclusions 
that it is hard to design proper rules of access for the integrated bottleneck, and 
that other methods of foreclosure can be prevented under vertical separation. 

2. Access price control: antitrust authorities link the price of access with some mea- 
sure of its cost. The ECJ first made such a comparison in the "United Brand" case. 
As is well known, the measurement of marginal cost is a difficult empirical matter, 
while the allocation of common costs among product lines has weak theoretical un- 
derpinnings. Excluded competitors may be required to prove the overpricing they 
suffered. 



3. Access quantity control whereby a significant share of the bottleneck is to be allo- 
cated to new entrants as in the "Eurotunnel" case. 

4. Price linkages is to use prices for retail goods or other accesses as benchmarks for 
the access price. Examples are the Efficient Component Pricing Rule (ECPR), the 
same pricing rule for all buyers, a single per-unit price or reciprocity rules. 

5. Common Carrier policies turn the vertical structure of the industry upside down 
thereby allowing all actors to access the consumer market (after paying an access 
price to the bottleneck). 

6. Disclosure requirements for supply contracts are intended to promote transparency 
and a greater downstream competition. 

The "Common Carrier" policy has been extended into vertical unbundling whereby 
a vertically integrated utility (in a monopoly position) is ordered to divest the network 
segment (e.g., route, track or canal) which is a natural monopoly and allow competition 
over the service or superstructure segment (e.g., truck, train or boat). 12@ 

10.4 Predation 

In this section, we formalize the strategies that firms can use to signal their strength 
or weakness and how they exercise predation to build a reputation that can be useful in 
the future. The first model explains how an entrant may signal its willingness to avoid 
cutthroat competition while the second shows how an incumbent can signal in advance 
its toughness to avoid later entry. The war of attrition treated in §7.4.2 is an alternative 
vision of economic struggle where the issue is market exit rather than entry. 

10.4.1 Introduction 

Definitions 

Predatory pricing is a deliberate strategy of setting prices below production costs. This 
is prohibited by EU competition law because this behavior has no other economic ra- 
tionale than to eliminate competitors, since it would otherwise be more rational not to 
produce and sell a product that cannot be priced above average variable cost. 13@ From a 
theoretical point of view predatory pricing amounts to build a reputation. 
Signaling (cf. §21.1.3) by an incumbent is to convey information that discourages un- 
profitable entry or survival of competitors. The hallmark is credible communication, 
through the inferences that potential entrants will make from observing costly actions 
either prior to entry in the case of limit pricing or afterwards in the case of limited entry. 



Examples 

NutraSweet, a division of US firm Monsanto marketed the low-calorie sweetener Aspar- 
tame whose patent for Europe, Canada, and Japan expired in 1987. A price war then 
occurred when a new company started to make and sell the paten-free Aspartame in 
Europe. Within two years the price was down by two thirds. The EC later condemned 
NutraSweet for predatory pricing and applied an anti-dumping duty on the imports of 
NutraSweet in the community. 

English firm Laker Airways started as a charter company in the late 1960s and ap- 
plied in 1973 for a permit to fly regularly between the UK and the US which was granted 
in 1977. The product called Skytrain in a direct reference to train travel offered flights 
without reservation and with minimum on-board service at the discount price of one 
third of the regular fare of big airlines. By 1980, Skytrain's market share was 14% but 
Laker went bankrupted in 1982 after competitors met his price. The US liquidator then 
sued several big airlines in US courts for predatory pricing arguing their prices were be- 
low marginal cost and worse, for pressuring the plane maker not to reschedule the debt 
payments due by Laker (in order to force immediate bankruptcy). This case created a 
row between the UK and US government which President Reagan ended by removing 
the case from the US court. The british judicial case was settled out of court in favor of 
Laker. 

A very frequent case of predation is cross-subsidization. Consider firm A which is 
earning handsome benefits in a protected market with not much future and firm B cur- 
rently losing money in a very competitive but promising market. If A and B merge, the 
cash-flow from A's activity today can be used to sustain an aggressive behavior in B's 
market aiming at evicting competitors so that tomorrow the AB company will enjoy a 
large market power in B's market and be able to rip large profits. 

An example of Signaling could be Nestle's intensive advertising for Nescafe; this 
behavior may be interpreted as a signal for toughness since there are only fringe com- 
petitors in most European countries. Likewise, one can interpret as signaling the large 
amount of advertising by firms with large market shares and a very high goodwill such 
as Coca-Cola, Vodafone or Nike. 

10.4.2 Limited Entry 

Gelman and Salop (1983) introduce a neat idea that has proved quite successful in busi- 
ness schools: you can use a larger competitor's strength against him. More precisely 
limited entry is to build a capacity A; small enough to convince the incumbent that it is 
not worthwhile for him to block entry, through a price war because he, too, would forgo 



profits. Let us study the sequential game of entry to ascertain whether this makes sense. 

The entrant offers a price p e and the incumbent respond with p t . Goods are homoge- 
neous, the demand is D = 1 - p and marginal cost is zero for both firms. The incumbent 
can undercut the entrant with a price just below p e , denoted pi = p~, and drive him out 
of the market for the next period (there is always a fixed cost to enter). However the 
payoff with this aggressive strategy is (1 - p e )p e which may be quite low. Thus, it may 
be better to choose pi > p e and anticipate a residual demand l-k-pt since the entrant 
has a small capacity (k < 1 - p e ) and will thus not be able to serve all his demand. The 
profit in that case is (1 -k-pOpt so that the optimal price is easily computed as p* = ^; 
it yields profits of ^-^-. 

The entrant will play on this possibility by choosing A; and p e small enough to con- 
vince the incumbent to let him in. The entrant can thus maximize its profit kp e under 
the constraints p* > p e (undercut the incumbent) and (1 - p e )p e < (1 ~ fc) (leave the incum- 
bent happy). We obtain two conditions on the entrant's capacity that must be satisfied 
simultaneously i.e., 

k < minjl - 2p e , 1 — 2^/(1 - p e )p e \ o k < min j|, 1 — 2^/(1 - p e )Pe\ 

Since the entrant's profit is increasing with capacity, he will choose a value that saturates 
the constraint i.e., A; = 1-2^(1 -p e )p e . The profit of the entrant is thus p e (1-2^/(1 - p e )p e ) 
and is maximum for p* e = \ — h= - 0.15, leading to k* = 1 - -4= - 0.29; the optimal response 
of the incumbent is p* = ^Jf- = -jj= - 0.35 generating sales of 0.36. In percentage of the 
monopoly profit (\), the profits achieved under limited entry are n e = 17% and %i - 50%. 

The entrant builds a small capacity, offers a very low price and earns a third of the 
incumbent's profits; he uses the strength of the incumbent like a Judo contestant uses 
the strength of his adversary. 



10.4.3 Limit Pricing f 

The Bain-Sylos postulate presented in §10.1 lead to the belief by many academics and 
practitioners that an aggressive pricing policy was predatory because it deliberately 
barred the entry of a market to a potential entrant. Since this reasoning is based on 
a hardly satisfied postulate, its truthfulness lies on shaky grounds. In a relatively recent 
work, Milgrom and Roberts (1982) use the asymmetry of information regarding the cost 
structure of the incumbent firm to explain why can aggressive pricing be deemed preda- 
tory. The basic idea is that a low cost firm can initially price very aggressively to signal 
its efficient technology to a potential entrant very much like advertising can signal the 



high quality of a product to consumers (cf. §11.5.2). Both works are applications of the 
signaling theory developed in §21.1.3. 

In the absence of signaling through limit pricing, the challenger would enter the mar- 
ket because, on average, he has a better technology. The incumbent will thus take ad- 
vantage of an initial period of trade where he has a monopoly situation to signal through 
its price that he is a strong incumbent in order to convince the entrant not to enter. 

To develop this idea assume Cournot competition if the challenger enters. Let q equal 
or 2c e with equal probability. Given the demand D{p) = 1 - p, a myopic incumbent would 
produce ^-^ in the first period and earns U m (Ci) = ~ c * . If this behavior credibly reveals 
Ci, then the second period after entry is a game of complete information where firms play 
a Cournot game whose equilibrium is q e = l ~ 2c ^ +Ci and qt = 1 ~ 2 ^ +Ce ; profits are 

U e {a) = (i-^+c/Ki + c +Ci ) and Ui{Ci) = a-2c, + c e m + c e+ c^ 

Now, assuming n e (0) < F < U e {2c e ) where F is the fixed entry cost, the challenger would 
enter against a high cost incumbent but not against a low cost one. Since U e (a) < U m (a) 
for all Ci, no kind of incumbent likes entry, thus the high cost incumbent may try to 
imitate the low cost one by producing ^(0) instead of q\{2c e ). If the challenger believes 
that q\ (0) signals a low cost incumbent and does not enter then the incumbent earns 

(l-q 1 (0)-2c e )q 1 (0) + U m (2c e ) 

instead of 

Yl m (2c e ) + Ud2c e ) 

if he allows entry. We see that there is a cost to block entry because flooding the market 
with the large quantity q\ (0) is costly for the high cost incumbent; but there is also a ben- 
efit of blocking entry to avoid the profit damaging second period competition. Blocking is 
therefore optimal if 

(l-q 1 (0)-2c e )q 1 (0)>U m (2c e ) 

i.e., if ^i(O) is smaller than some threshold c\\. 

Up to now, we have been studying some level 1 of anticipation by the high cost incum- 
bent. Let us pass to level 2: the entrant anticipating this imitation strategy will keep 
thinking that there is a 50% probability that the incumbent is weak because he believes 
that both kind of incumbents are producing the same quantity q\ (0) in period one. If the 
fixed cost F is lesser than ne(0)+ " e(2Ce - ) , then the challenger will enter in the second period. 

Passing at level 3 of mental induction, the strong (low cost) incumbent would like to 
avoid the entry of a challenger that is not able to distinguish the weak from the strong 



incumbent. Hence, the strong incumbent must do something that the weak would not 
imitate: produce so much in the first period (flood the market) that the losses for an 
imitative weak incumbent would be so large that even being a second period monopoly 
would not compensate i.e., producing q\ > c\\. This is the essence of limit pricing. Precise 
computations (left as an exercise to the reader) yield 



1 y72<%-36c e + 5 

q\{c e ) = --c e + . 

z b 

This function is convex (U-shape) with a minimum for c e - 0,35. The profit for the low 
cost incumbent, after having successfully signaled its strength is (1 - q\{c e ))q\(c e ) + n m (0). 

Overall, limit pricing will be profitable if this profit is greater than what he would 
get letting the challenger enter the market i.e., n m (0) + 11/(0). The condition is thus (1 - 
qi{Ce))qi{c e ) > ri/(0) which proves to be equivalent to c e < c e = 17+ 6 3 8 - 0.43 (another exer- 
cise!). 

To conclude, limit pricing can be observed for a large range of entrant cost, even a 
very small one, a case where the challenger and both types of incumbents have very 
similar marginal cost. 

10.5 Industrial Policy and Regulatory Barriers 

Rodrik (2004) recalls that industrial economic activity is ripe with market-failures such 
as informational asymmetries, externalities in human capital or demand, increasing re- 
turns to scale (lumpy investments) and more generally coordination failures. Policies 
coordinating and stimulating specific economic activities or promoting structural change 
are thus a legitimate area of government intervention called industrial policy; it is im- 
plemented with import-substitution, export facilitation, promotion of foreign investment, 
free-trade zones and targeted subsidies. 14 ® 

In the case of industry, there are large spillovers (network externality) because once 
there is proof by one successful entrepreneur that the deed can be done, countless imi- 
tators flock-in and turn the nascent industry into a competitive one where no-one, even 
the first-comer, earns an exceptionally high return on investment. 15 ® Oddly enough, it is 
in sectors with the highest entry barriers (e.g., government regulation) that innovation 
springs up since, only there are entrepreneurs better protected (this even-though the 
initial cost is higher, due to the barriers). This may explain the regulatory barriers upon 
which we report after. 

The first-best policy to foster self-discovery would be to subsidize investments in new, 



non-traditional industries but this is largely a theoretical desire, impossible to imple- 
ment because monitoring is too costly. The situation is somewhat analogous with respect 
to technological externalities that flow from R&D in which case the first-best is an R&D 
subsidy but the real (second-best) policy is patent protection. Note also that an optimal 
subsidy policy requires equating the social marginal cost of investment to the expected 
return of projects in new areas. Hence the successes make up for the failures if, and this 
is the crucial point, these are phased out in time. A good industrial policy is thus carrot- 
and-stick; it needs to encourage investments in non-traditional areas, but also weed out 
projects and investments that fail. A government that makes no mistakes is simply not 
trying hard enough (think of new drug designs). 

Regarding the coordination of investment activities displaying scale economies (so- 
called big push), the problem is social learning i.e., discovering where the information 
and coordination externalities lie and therefore what the objectives of industrial policy 
ought to be and how it is to be targeted. The implementation then require bureaucrats 
to interact deeply with industrialists at the risk of ending up in bed with them. There is 
thus a tension between closeness and independence. 

The State, motivated by security and quality concerns (cf §16.2 & §17.1) and also in 
response to pressure from industries (cf. §16.2.2 on special interest groups) tends to limit 
entry in many activities. As shown by Conway et al. (2005), there still exists a number 
of regulatory impediments to product market competition in areas where technology and 
market conditions make competition viable. 

• State control (SC) is conducive of inefficiency in the absence of market failure because 

competition is likely to be distorted when one player, the government is both judge 
and party. 

• Barriers to entrepreneurship (BE) are all these administrative opaque procedures and 

burdens necessary to create a firm on top of legal barriers to competition. 

• Barriers to commerce (BC) are the tariffs or ownership limits and discriminations ap- 

plied onto foreigners. 

Table 10.1 shows these 3 indexes as well as their average for a variety of OECD 
countries in 2003. Overall, regulatory impediments to product market competition have 
declined in the OECD area in recent years (cf. last column). 
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Poland 


36 


22 


24 


27 


-12 


Italy 


31 


14 


11 


18 
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France 


26 


15 


9 


17 


-7 


Spain 


27 


15 
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16 


-7 


Germany 


22 


15 
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14 


-5 


Sweden 


19 


10 
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12 


-5 


Netherlands 


19 


16 
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13 


-4 


United Kingdom 


17 


7 


3 


9 


-2 


Turkey 


28 


24 


16 


22 


-9 


Mexico 


19 


22 


23 


21 


-2 


Korea 


16 


16 


12 


15 


-9 


Japan 


15 


14 


9 


12 


-6 


Canada 


16 


7 


11 


11 


-7 


United States 


11 


12 


7 


10 


-2 


Australia 


5 


11 


8 


8 


-4 



Table 10.1: Product Market Regulation in the OECD 
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Differentiation and Competition 



In the early XX th century, the discrepancies between empirical observations and the 
predictions of the Marshallian theory of perfect competition were ascribed to exogenous 
frictions until Sraffa (1926) came to argue instead that these frictions were endogenously 
created by firms. Translated in today's economic language, he stated that firms try to 
differentiate themselves from rivals to break the homogeneity of goods that produces the 
deleterious effect of perfect competition. In his own words: The causes of the preference 
shown by any group of buyers for a particular firm are of the most diverse nature, and may 
range from long custom, personal acquaintance, confidence in the quality of the product, 
proximity, knowledge of particular requirements and the possibility of obtaining credit, to 
the reputation of a trade-mark, or sign, or a name with high traditions, ... have for their 
principal purpose that of distinguishing it from the products of other firms. 

Inspired by these insights, the founders of the theory of imperfect competition Hotelling 
(1929) and Robinson (1933) recognize that the own price elasticity of demand is negative 
but not infinite as would predict perfect competition. Thus, when a firm increases her 
price, she starts losing customers, but neither none nor all of them. Secondly, the per- 
fect competition framework is mute about the destination of the customers lost by the 
firm; will they refrain from consuming or buy from a rival? The models of imperfect 
competition we shall develop hereafter seek answers to these questions. 

11.1 Horizontal Differentiation 

A classification of differentiated products will be of great help to address the questions 
asked in the introduction. When some consumers prefer a brand to another equally 
priced while others hold the reverse preference, products are said to be horizontally dif- 
ferentiated (e.g., two identical cars except for color). If, on the contrary, everybody agrees 
on which is the best of two equally priced products, then they are said to be vertically 
differentiated (e.g., a TV for sale in two shops offering warranties of different duration). 



According to the type of differentiation, the consequences on competition differ greatly. 
We treat in turn horizontal and vertical differentiation. 

In this section, we first take a look at the general principles of horizontal differen- 
tiation before treating in detail the selection of locations in Hotelling (1929)'s classical 
model whose study was started in §5.2.2. Then we study how strongly firms are willing 
to differentiation i.e., obtain a cost advantage or a demand one. 

11.1.1 Differentiation Principles 

As will be illustrated afterwards, the literature has tended to argue in favor of the maxi- 
mum differentiation principle, according to which firms maximally differentiate in order 
to relax price competition. It should be noted that the result does not depend on the 
uniform distribution of consumers along the market segment. In the more realistic case 
where some areas are densely populated, firms will try to locate themselves at the epi- 
center of these clusters, as local monopolies. Train or tube stations provide an illustra- 
tion; they are indeed surrounded by numerous shops, but in general one of each kind, as 
theory suggests. 

In §11.2.1, we extend Hotelling's model to capture more issues and explain why his 
intuition was almost right in the following sense: 

I Firms differentiate their products and services along their most importance charac- 
teristic but otherwise choose the same attributes. 

This is entirely consistent with the casual observation that within the shopping area 
of a large city, each street seems to be exclusively devoted to one particular good i.e., there 
is minimum differentiation. Yet, location is a secondary characteristic when compared 
to brand or design or style that define the product and shopping experience offered to 
clients. In that primary dimension, firms seek maximal differentiation. 

To understand intuitively how such an equilibrium occurs let us take a dynamic point 
of view. The reason a particular street ends up specialized in clothes, restaurants or the- 
aters has much to do with the activity of the first shop who had an important commercial 
success in the street. Since the historical shop drains many consumers, new firms settle 
nearby to attract the attention of these potential customers and steal a bit of the incum- 
bent's business. Then, the word of mouth spreads the information that this particular 
street hosts several similar shops and this increases the number of shoppers because 
consumers are certain to find what there are looking for thanks to this extended supply 
(their opportunity cost of passing by is reduced). The virtuous circle is engaged since 
this additional demand will motivate more firms to enter driven by their desire to grab 



a share of the cake. At some point, there is more supply than demand and to avoid a de- 
structive Bertrand competition, the newly entered firms have to differentiate themselves 
through other dimensions like quality or specialization into a segment of consumers, a 
food style, a serving style, etc... One case where this clustering of similar shops is almost 
impossible (or leads to frequent failures) is pure distribution. There are no two adjacent 
supermarkets in the world because they all sell the same brands and can only compete 
on prices "a la Bertrand". 1@ We can conclude that 

I Minimum differentiation e.g., geographical agglomeration, is driven by the expansion 
of demand more than by the desire of firms to steal the business of their competitors. 

Another instance where location is crucial is when prices are regulated (often to pro- 
tect small businesses from undercutting strategies by large firms). Competition then 
takes place only on the location in order to attract the maximal number of potential cus- 
tomers. In the case of two firms, they both end up in the middle of the market. The 
same phenomenon occurs with free Hertzian TV where channels offer almost identical 
program all day long. Likewise, political parties who cannot compete on price terms 
make electoral promises leaning toward the center in an intent to grab the other side's 
traditional clientele (at least in bi-polar countries). 

11.1.2 Competition for Location 

In §5.2.2, we started to study Hotelling (1929)'s classical model of horizontal differen- 
tiation based on geography. Two shops selling the same commodity are located at the 
extremes of a street. We showed their ability to sustain prices higher than their mar- 
ginal cost if consumers have to bear a transportation cost to go from their home to the 
shops. 

We now tackle the intensity of differentiation i.e., the issue of location: is it better 
to move away from one's competitor to relax competition (and charge higher prices) or 
come nearby to capture his clientele? In this process, each firm anticipates the effect her 
positioning will have on the setting of prices afterwards i.e., our equilibrium concept is 
the subgame perfect equilibrium (cf §2.4.2). 

Price competition 

Given the prices quoted in equilibrium, the demands addressed to the firms are given 
by the plain lines of Figure 11.1. Hence, by reducing the address b to b , firm B would 
gain market shares (the utility of B's customers is now given by the dashed line) since 
x shifts leftward to x . This observation lead Hotelling (1929) to formulate the principle 



of minimum differentiation according to which firms competing in prices and product 
attributes tend to supply identical products (choose the same attributes). 2 ® 




a x' x b' b 1 

Figure 11.1: The Hotelling Model of Location 

We modify the model of price competition developed in §5.2.2 by adding an initial 
stage where firms choose their locations a and b inside the street (with the convention 
that a < b). We also use a quadratic transportation cost to facilitate the analysis. Given 
firms locations a and b (cf example on Figure 11.1), the utility of a consumer located at 
x from buying is 



Ua(x) = p- pa~ t{x- a) 2 and Ub(x) = p - ps - t {x - b) 2 

hence the indifferent consumer is located at the address solving ua - ub i.e. 

Pb-Pa a + b 



(11.1) 



x = 
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2t{b-a) 2 
Since tta = (Pa - c)x, we deduce firm A's best reply as 

p A = Uc + p B ) + t{b-a) 



(11.2) 



a+b 



(11.3) 



a+b 
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and from ttb = (Pb - c)(l - x), firm B's best reply as 

p B = \{c + p A ) + t{b-a)\l- 

The equilibrium of the pricing game solving system (11.2-11.3) is 3 * 

f p* A = c + | (2 + a + b) [b - a) 
J p* B = c + | (4 - a - b) {b - a) 



(11.4) 



(11.5) 



Differentiation 

We can now study the incentives for firms to differentiate horizontally. The first stage 
profits of firm A is now a sole function of locations U A {a,b) = {p* A -c)x{p A ,p* B ), thus 



dYl A dU A dp* A ^ (dx dx dp 



'B 



+ {p*-c)\ — + r -±\ (11.6) 

da dp A da \oa dps da I 

Since p* A is a best-reply j^ = (this is the envelope theorem), so that, 

dll A dx dx dp* B 

oc 1 (11.7) 

da da dp B da 

As we already saw in §6 on strategic moves, the overall effect of differentiation is 
the sum of a direct demand effect (first term in (11.7)) and an indirect strategic effect 
(second term) through the change in pricing policy of the opponent. A positive (resp. 
negative) value in (11.7) would motivate firm 1 to reduce (resp. increase) differentiation. 
The demand effect is positive since (11.2) yields || = \ + 2ub-a) 2 > 0; thus the firm ought 
to move to the city center (minimize differentiation) in order to gain market shares. Yet, 
such a move is met by an increased toughness of firm B in the price competition as 
_Pji _ 2t{b-2) < ^yhjgh j n turn lowers profits since ■M- = 2 t{l-a) > °- 

We have here a typical phenomena of economic reasoning where simple intuitive ar- 
guments lead us to opposite conclusions. The only way to disentangle the issue is to 
specify a simple enough model that allow each effect to be computed in order to deter- 
mine which one dominates the other. In the Hotelling model, we can plug the equilibrium 
prices given by system (11.5) into the profit functions 71,4 and ttb to derive the reduced 
form profits as 

U A (a,b) = ±(2 + a + bf(b-a) 

r o (11. 0) 

U B {a,b) = ±(4-a-bf(b-a) 

It is a simple exercise to check that ^- oc {2+a+b){a+b-2) < and likewise ^f > i.e., 
the strategic effect dominates the demand one. These inequalities mean that, meanwhile 
a > 0, firm A prefers to move its location leftward and symmetrically, meanwhile b < 1, 
firm B prefers to move its location rightward. 

I The equilibrium locations choices are simply the boundaries of the city so that the 
"principle of maximum differentiation" holds. 

If new firms can enter sequentially and incumbents can adjust their locations then in 
equilibrium firms are always located at the same distance one from another. If on top, a 
firm can open several shops, it will never sit a new shop next to an old one because this 



would cannibalize existing customers. On the contrary, a firm sits new shops between 
his competitors'. 

Multi-unit Purchase 

Up to now, consumers were only willing to buy one unit of the commodity which fits 
well the case of durable goods. To treat classical retail competition where people buy 
many units of many different goods, it is enough to assume that consumers buy varying 
amounts of a composite good (e.g., that used to construct the CPI). The choice among 
two possible shopping locations is thus driven by the comparison of final utility levels. 
In equation (11.1), it is enough to replace the surplus p-pt by the utility level u{p{) 
derived from buying an optimal quantity at shop i given the (composite good) price pi 
in force there. This way, market shares continue to be determined by the difference in 
transportation cost of the indifferent consumer x. Firms' profits are then the product 
of sales by the unitary profit margin which is the same for all consumers since we are 
implicitly assuming away wealth effect (the transportation cost does not influence the 
amount spend at the shop). 

11.1.3 Oligopoly & the Circular City 

Although appealing, the Hotelling linear city model of competition does not readily ex- 
tend beyond the duopoly. Salop (1979) introduces the unit length circular city where 
firms are located equidistantly on a turnpike. 4 ® As an example, observe that many west- 
ern cities are surrounded by a highway ring (turnpike) next to which sit malls and super- 
markets at more or less the same distance. If consumers are uniformly distributed along 
the ring (and cannot travel inside the disc to reach another point of the ring) then a mo- 
nopolist will locate anywhere on the circle. In real conditions, such an indifference does 
not occur because there is always an accumulation point (or road knot) that is nearer to 
a majority of consumers. Locating at this point reduces overall transportation costs to a 
minimum which enables the monopoly to charge a maximum price for its services. Un- 
der the maximum differentiation principle, an entrant would choose the opposite point of 
the incumbent monopoly. More generally, upon entry or exit, firms readjust their location 
(characteristic) to remain equidistant one from another. 

When n shops are active on the market, they locate at distance 8 = \ from each other 

on the ring. 5@ The westward shop A has a northern neighbor N and southern neighbor 

S; there are thus a northern and southern indifferent consumer, located at distance x N = 

+ 2S^ PA an d %s - + 2St PA f rom A (this is equation (5.15) in Hotelling's model where the 

transportation cost per km t is multiplied by the distance 8). 



If people are uniformly distributed across the ring, a fraction 1 live between A and 
each of his two adjacent neighbors; he thus grabs ^xs from the south side and ^x^ from 
the north side i.e, 

D A = ^x s + x N ) = S -±±^= 1 - + )ip B -p A ) (11.9) 

where ps = PN ^ Ps is the mean price of A's direct neighbors. Firm A's demand is thus the 
fair share of the market, ~, and a bonus proportional to the difference between his price 
and his direct opponent's average (cf eq. 5.15). 

Assuming that all firms share the same zero cost technology, individual profit is tta = 
PaDa oc Pa{Ji + Pb~ Pa), the best reply is easily characterized as 

Pa=1[$ + Pb) (H.10) 

and the symmetric equilibrium is p* = ^. Market size m is measured by the population 
density which was taken to be unitary. It is easy to check from the formulas that ac- 
counting for it amounts to replace the transportation cost t by ^. We can thus confirm 
the intuition according to which 

More competitors increases competition in the sense that the unit margin shrinks. A 
lower density of population or a greater transportation cost enable firms to exercise 
greater market power. 

Tackling the oligopoly case also raises the question of entry; in §6.1.4, we show that 
open access to the market leads to excessive entry which would be reduced if the market 
was cartelized. 

11.1.4 Urban Economics 

Land Rent Distribution 

If Hotelling (1929) can be deemed the father of urban economics, 6 ® geographer von Thii- 
nen (1826) deserves the grandfather title by introducing the spatial dimension to eco- 
nomics, in connection with the classical theory of land rent. This author considers a self 
sufficient "Isolated State", free from external influences, with one centrally located city. 
Land is flat while the soil quality and climate are homogeneous. Farmers transport their 
own goods to the city market across land and act to maximize profits. 

In equilibrium, we should observe four rings of agricultural activity surrounding the 
city. Dairying and intensive farming occur in the ring closest to the city because vegeta- 
bles, fruit and milk must get to market quickly (otherwise they spoil). Timber for build- 
ing and firewood are produced in the second ring because it is very heavy and costly to 



transport. The third ring consists of extensive fields crops because they last longer than 
dairy products and are much lighter to transport than wood, thus they can be located 
further away from the city. Ranching (cattle) is located furthest away because animals 
are self-transporting. Beyond the fourth ring lies the unoccupied wilderness, which is 
too great a distance from the central city for any type of agricultural product to be eco- 
nomically viable. As one gets closer to the city market, the price of land increases and so 
does the productivity of the farming products. This results occurs because farmers bal- 
ance the cost of transportation and the cost of land to decide which activity to perform at 
any location. The most cost-effective product for the city market outbids other options to 
rent the piece of land. 

To understand this result we can imagine that, in line with historical property dis- 
tribution, the land is owned by landlords; they allocate land to the farmers who bid the 
highest rent at each distance x from the center. For each of the farming categories i = 1 
to 4, above let pi be the equilibrium price in the city market, 7 ® let ft be the marginal 
productivity of the activity 8 ® per acre and t\ the transportation cost per unit of distance 
of one unit of product i. A farmer growing type i at distance x from the city will earn 
Ttiix) = fi(pi - Ux) per acre. If we let n denote the subsistence level of a farmer, the maxi- 
mum bid anyone can make for cultivating an acre of land at x with type-z crop is 7ti(x)-n. 
Assuming, in accordance with historical reality, that there are many more farmers than 
landlords, a Bertrand competition occurs among farmers to get the land; hence, in equi- 
librium, they bid their willingness to pay which is nt{x) -n. 9@ As seen on Figure 11.2, 
the upper envelope of the bid-curves determine the equilibrium crops at each distance x 
from the city. 

Dairy products appear first because they have a very high transportation cost (steep 
curve) since they are perishable; although their unit price is not so high they have a 
very high productivity per acre since breeding requires fewer land than anything else. 
The next curve, that of timber, is characterized by high transportation cost and good 
productivity (trees grow towards the sky). Outer rings tend to be larger and larger as 
they correspond to activities requiring a lot of space. Cultivation ends at some distance 
X4 when the lowest economic value of land equals the subsistence level of a farmer R. 
Observe with the dashed bidding curve that crop 5 is not cultivated because the price of 
its output is too low to make it worthwhile anywhere. 10 ® 

City Housing Distribution 

The previous framework is easily extended to modern cities to analyze the distribution 
of income and square space , as shown by Brueckner (1987). 

Households working at the city business district (CBD) in the center, consume hous- 




Figure 11.2: The von Thunen Spatial Equilibrium 



ing q and a composite good z; given the income w and the price p x of land at x, the budget 
constraint is z + p x q < w-tx where t is the transportation cost measuring the opportu- 
nity cost of time lost in transportation. Individual demand is a function of distance x and 
housing price p. The large number of households is then distributed across locations so 
as to equate their final utility level. 11@ This additional equation allows to work out hous- 
ing price p x and consumption q x parametrized by distance to the CBD. Looking back at 



the budget constraint, we have p x = 



w-tx-z 



which can also be interpreted as the WTP or 



bid of the household to live at distance x from the center. 

To learn about the shape of this bidding curve, suppose a person moves away from the 
CBD; she has now higher transportation cost, thus less disposable income and demands 
less of everything including housing, should prices be kept constant. She would be worse 
than before and try to move back. To retain her at this new location, her landlord has 
to reduce the price p x . l2@ Thus the bidding curve is decreasing as in the farming story. 
Also, dwellings away from the CBD are larger since housing is cheaper wrt. other goods, 
so that people consume more of it. 13@ 

If we further assume that housing is a normal good, its demand increases with income 
which means that poorer households have steeper bidding curve. 14@ Re-interpreting Fig- 
ure 11.2, crop labels i = 1....5 become wealth classes, so that in equilibrium, the poorer 
households outbid the richer ones to live nearby the center, although the surface they 
buy is smaller. Rich households occupy greater surface but at greater distance from the 



CBD. 

This model fits quite well with US reality; to account adequately for the rather inverse 
location structure of European Cities, one has to account the higher opportunity cost of 
time of richer households who value the accessibility to the central business district 
(CBD) for work as well as for cultural amenities i.e., t x increases with income x. As 
shown by Brueckner et al. (1999), if the ratio of opportunity cost over housing 4- rises 
with income (amenities really matter), then the rich tend to live at central locations. A 
classic treatment of urban economics is Fujita and Thisse (2002). 

11.2 Location and Variety 

In this section, we show how to relate product location and product variety as instances 
of horizontal differentiation. In the last part we recall the agricultural origin of the 
whole process of geographical differentiation. 

11.2.1 Variety and Opportunity Cost 

Lancaster (1966) formulates an innovative interpretation of horizontal differentiation 
without relation to the original geographical considerations of Hotelling: firms develop 
brands or labels for their products to distinguish them from other brands sold in the 
same market segment i.e., to reduce the extent of competition. In other words, firms en- 
gage in interbrand competition like Danone and Nestle do over yogurts using advertising 
and innovations on taste and recipe. 

Formally, firms differentiate their products and services by selecting differing at- 
tributes or characteristics because they appeal differently to the subjective tastes of 
consumers. In Lancaster (1966)'s approach presented on Figure 11.3, each consumer 
is characterized by an ideal (subjective) combination of attributes; he thus bears an op- 
portunity cost from being forced to buy one of the few brands available on the market 
(e.g., limited choice of colors, shapes or motors for cars). The result of this form of com- 
petition is market segmentation, a segment being the set of consumers who are likely 
to buy a particular variant. On the cost side, it must be noticed that economies of scope 
such as having a single marketing department for all varieties of a given product line, 
are crucial to explain the apparition of varieties. 

One could draw the conclusion that if variants are cheap to design, each firm would 
introduce many of them to occupy all market niches (being nearby clusters of consumers 
on Figure 11.3). But as variants appear there is less and less differentiation in the 
whole market since every consumer is now nearby to several available models; thus 
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Figure 11.3: The Space of Products Characteristics 



there is a tougher competition, not only among firms but also among models of the same 
company. 15 ® One could also wonder whether firms will try to locate their models away 
or nearby their competitors. A concrete example taken from Fraiman et al. (2003) is 
presented on Figure 11.4 for clothing were the brands of the Spanish Inditex company 
are in italics while those of its main contender Cortefiel are underlined (other brands 
appear in regular type). A total of eight characteristics are plotted over the two panels. 
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Figure 11.4: Products Positioning for Apparel 



Another example of development of products along two important characteristics is 
given on Figure 11.5 for the aircraft industry. We also notice from historical events that 
competitors, especially Airbus, tend to introduce close substitutes to existing models 
except for "jumbo jets" where the Boeing 747 stayed 35 years unchallenged before the 



launch of the Airbus A380 in 2004 (cf. Esty and Ghemawat (2002)). 
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Figure 11.5: Aircraft Characteristics along Size and Range 



11.2.2 Multi-dimentional Differentiation 

To be able to answer the question of where is it best to locate in the characteristics space, 
we need a simplification. As we said above, the distance between one's ideal combination 
of characteristics and one of the available model is an opportunity cost. Suppose that the 
design of a car can be ranked according to technical criteria from the definite classic (0) 
to the latest avant-garde (1). Similarly there is a [0; 1] scale for performance axis ranging 
from thrill (0) to security (1). Each consumer values at price f ; the distance between the 
two extremes for each characteristic i. The overall opportunity cost from buying the pair 
of characteristics z = {z\, z-fi instead of his ideal bundle x is then the linear combination 16 ® 
t\{x\ - z\) 2 + t2{X2 - Z2) 2 . To ease our study of multi-dimentional differentiation, the term 
tiixt - Zi) 2 is replaced by (x. - z.) 2 with x. £ [0;v^]. Since attribute #1 is the strong one, 
the space of characteristics is a stretched rectangle as display on Figure 11.6. 

The first result is a reduction of complexity. From the point of view of firms, several at- 
tributes uniformly distributed are akin to one synthetic characteristic with a bell-shaped 
distribution. Consider the left panel of Figure 11.6 where firms' locations are shown by 
black dots. We can draw a new axis associated with a synthetic attribute z whereby firms 



are located at za and zb) we let z = Za+ 2 Zb denote the locations' mid-point. The uniform 
consumer distribution over the rectangle gives rise to a bell-shaped density function, it 
is given at every z by the height of the segment orthogonal to the axis joining the top of 
the box to the bottom (dashed segment). We thus start with /(0) = 0, increase until zq, 
remain constant up to z\ and decrease until /(l) = 0. The associated distribution function 
is thus increasing and S-shaped. 
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Figure 11.6: Two dimensional attribute space 



Let us now study the price equilibrium and the first stage of differentiation. With 
our quadratic opportunity cost, an indifferent customer is such that Pb~ Pa = T.i< 2 ia.i + 
bi -2xi){bi - at). The location of these individuals is a segment orthogonal to the z axis. 
It passes through z when prices are equal and moves in parallel fashion for unequal 
prices. We see on Figure 11.6 that if the differentiation choices satisfy fa 2 ~" 2 < j- then 
the frontier between the two market shares hits the top and bottom of the rectangle; 
this is the likely situation since attribute #1 is the strong one. As shown on the right 
panel, a reduction of differentiation by firm A regarding the weak attribute (at constant 
prices) moves the indifference line (independently of prices) in a manner that makes 
her more competitive towards a large chunk of clients and less competitive towards a 
smaller bunch of clients. This result means that firms will tend to imitation over the 
weak attribute and differentiate in the dominant attribute to relax price competition. 



More precisely, Irmen and Thisse (1998) show that demand is Da 



Pb-Pa+T 



2fi(tri-«x) ES m 

(11.2) with y = t\ {bi -ai){b\ + ai) + t 2 [b 2 - a 2 ) [b 2 + a 2 -l). The price equilibrium is as in (1 1 .5): 
p* = 2t 1 ib 1 -a 1 )+ r = t^r+^2 where v I / i = {bi _ fli)(2 + b 1 + ai) and ¥ 2 = (b 2 - a 2 )(b 2 + a 2 - 1). As 

in (11.8), profit is n* A = %$fTfr 2 zi\ , thus the optimal differentiation for the weak attribute 
a 2 maximizes ¥2 i.e., a* 2 = \- We thus see that differentiation over the weak attribute 
amounts to reduce the sustainable price (^2 < 0) because consumers are more difficult to 
capture which makes competition more intense. Since firm B will do likewise and choose 



b% = \, we have ^2 = so that the profit formula reduces exactly to (11.5) for which we 
already know that a\ should be minimized. 

11.2.3 Monopolistic Competition 

Chamberlin (1933) outlines a theory of indirect oligopoly competition where a firm's de- 
mand is more elastic than the entire market demand. The idea is that brands are dif- 
ferentiated enough for own-price elasticity to be finite, unlike in the perfect competition 
paradigm. Decades later Spence (1976b) and Dixit and Stiglitz (1977) formalize the idea 
with a simple yet powerful specification of preferences. Instead of relying on complex 
Hotelling models where heterogeneous consumers pick a brand among many, homoge- 
neous consumers display a taste for variety and thus consume a bit of all brands. The 
main thrust of this approach is to yield for each firm a demand with constant price elas- 
ticity. 

Competition 

Given a presumably large number of versions of a product (indexed by j), consumers care 

for the composite good y - VLjX B \ where < 6 < 1 is a substitutability parameter. 17 ® 

The representative agent maximizes u{y,x ) under the budget constraint x + T.jPjXj < 

1+T.j Kj where the numeraire x stands for an aggregate of all other goods, / is non capital 

income and Ttj is the profit of firm j. It is fairly easy to prove that the consumer solves 

his optimization problem in two steps. He first arbitrates between xq and y taking the 

"composite" price to be p = \LjP~, with p" = ^. This yields a "composite" demand 

1 
D{p). Then, he optimizes among available versions with Xj = D{p)\j-Y~ B . If there is a 

large number of available versions, then the composite price p is unaffected by pj and 

this demand displays constant elasticity fzg. It is also the cross elasticity of substitution 

between any two versions since ^ = (fH * e • 

Each version of the product is produced with the same technology involving a fixed 
cost F and constant marginal cost c. A key implicit assumption is that when two (or more) 
firms produce the same version of the product, they engage into Bertrand competition 
because the resulting product is homogeneous for consumers. Marginal cost pricing then 
ensues and since fixed cost are present, only one firm can survive (cf §5.2.1); this enables 
us to identify from now on a version with a brand. The constant price elasticity of dj leads 
to the optimal price pj = p* = (1 + p)c (cf. Lerner rule (3.4)). Symmetry then yields p* = 
p* nT? 1 . Firms enter the market while it is profitable i.e., < n = ficq-F^ q>q = j- c . Given 
that the overall expense on the good is £ ; - qjpj = pD{p) and that sales are identical among 



brands, we have q* - p n . , thus the limiting number of firms n* is the integer part of the 
solution to q = q* o Fn l+ P = (3cD ((1 + p)cn~P). If the utility function is Cobb-Douglas with 
u[y,x) = y a x 1 ~ a , then D{p) = ^ at the zero profit equilibrium, so that n* = aI[l ~ 6) . 

Efficiency 

The efficiency of monopolistic competition can now be assessed along the second and 
first best lines. The constrained optimum maximizes utility under a no-loss restriction 
for each active firm. This amounts to minimize the composite price p. By convexity, the 
solution is symmetric with p = pn~P. By duality, the solution is also profit maximization 
under a composite price constraint p <p. On an isocurve p = cte, we have p = ^ = ^ 
and since profit is n = [p - c)^-^- - F, the sign of |^ is that of pnp - {p - c)[p + np) oc 
(1 + p)c- p hence profit reaches a maximum at p = (1 + /3)c. Since the price at the free 
entry equilibrium and at the second best optimum are identical, so are the number of 
active firms (given that none makes a profit). Hence, we may say that monopolistic 
competition is constrained efficient. 

The unconstrained optimum involves pricing at marginal cost and compensating firms 
with lump sum transfers to cover their fixed cost. By convexity, the optimum is symmet- 
ric, thus qi = q,y = qn 1+ P and p = cn~P so that utility is u[l - n{F + cq),qn 1+ P) since total 
cost of production have to be borne. The FOC over n yields ^ q p = ^ but since this 
ratio is equal to p at the consumer optimum, the solution is again q, the equilibrium in- 
dividual sales. From p = c< p* , we deduce p < p* for the same number of firms n* , hence 
total composite demand is y = D{p) > D{p*) = y* which means that more firms must be ac- 
tive (as each sells the same). We then see from the formula for the composite price that 
the difference p - p* is increased. The first-best thus involves the same individual firm 
size but a greater variety (more firms) because the greater sales allowed by marginal 
cost pricing allow to amortize more fixed cost at the economy level. 

It is interesting to contrast this result with that obtained in the Hotelling model of 
§11.1.3 where full market coverage lead to opposite conclusion. The key difference is 
that total demand is not fixed but expands with variety (number of firms) hence allows 
to generate more consumer surplus that can then finance the entry of more firms. 

11.2.4 International Trade 

The modern theory of international trade has borrowed the models of industrial orga- 
nization to overcome the limits of the perfect competition paradigm to study commerce 
among nations. Indeed, in the ubiquitous presence of fixed cost (or scale economies), 
perfect competition leads to marginal cost pricing and overall losses whereas monopoly 



guarantees survival but generates a welfare loss and is a poor match to the reality of 
markets. 

Trade 

Centuries ago, Mercantilists saw international trade as a zero sum game i.e., a situation 
where the gain of one must be the loss of the other. Smith (1776) bailed them out with 
the absolute advantage whereby a country develops an efficient technology to produce a 
good at a lower cost than others. Later on, Ricardo (1817) claimed that a least developed 
country was not doomed to autarchy as it could specialize in the sector where it had a 
comparative advantage, although in the absolute, it would remain an inefficient world 
competitor. 18 ® Trade was then mostly inter-industry and lead to a high degree of spe- 
cialization (e.g., Portugal sells wine to England who sells wool to Italy who sells clothes 
to Portugal). 

With the advent of the industrial revolution and large scale production, advanced 
countries (and regions within countries) started to trade the same goods albeit in ver- 
sions differentiated by quality or specification. This is now the major component of trade. 
Krugman (1979) provides the first formal explanation to this phenomenon with a simple 
application of the monopolistic competition model. 19 ® 

Intra-Industry Trade 

To address the effect of trade on workers' wage, labour is assumed to be the sole pro- 
duction factor with endogenous wage w and exogenous endowment L. We may thus 
introduce parameters y and / to write / = wL, c = jw and F = fw, leading to the opti- 
mal relative price — = (1 + fi)y and limit number of firms n = I(1 ~ g) in the Cobb-Douglas 

W J 

specification. 

The two countries, Home and Abroad, are identical except for labour endowments L 
and L. They have no comparative advantages that would generate inter-industry trade 
yet intra-industry trade will take place. Upon eliminating barriers to trade, the total 
number of brands accessible to consumers rises to n + h which is exactly that correspond- 
ing to the grand economy uniting all the resources of Home and Abroad. Wages can be 
normalized to unity in each country (w = iu = 1) since only the relative price ^ matters. 
Now, we observe that workers have no reason to emigrate, so that production continues 
as before in both countries. Lastly, the equilibrium remains unchanged because (l + /3)y 
remains the optimal relative price for all firms whatever their location. The reason 
why intra-industry trade takes place is now clearcut: since consumers buy all available 
brands, a proportion -^j of Home's income is spend over foreign brands (and vice versa 



for Abroad). The greater number of consumed brands also translate into a greater wel- 
fare. 20 ® Trade volume is jK, thus maximum when countries are of equal size. 

Inter-Industry Trade 

In order to study both inter and intra-industry trade, Krugman (1981) extends the basic 
model as follows. The economy now involves two sectors (e.g., industry and agriculture) 
with the same production technologies (same y) but differing labour endowments (A, 1-A). 
The previous results imply that ^j- = (1 + p)y for all sectors. Since firms earns zero profits, 
total wages in sector i, WiL t are equal to their share of national revenue which, for 
simplicity, we take to be one half (i.e., we set a = 1/2 in the Cobb-Douglas utility function). 
The number of firms in sector i then satisfies ^ = fer. 

Consider then two otherwise identical countries with mirror endowments (A, 1-A) 
and (1 -A, A). By construction, the grand (free trade) economy has the same number of 
firms in each sector, thus wages are equalized across sectors i.e., the scarce factor in 
each country loses from opening up frontiers. As before, consumers in both countries 
consume all available varieties, local and foreign, hence there is trade. Home, being the 
producer of A% of the good #1 varieties, exports this good for a value of A% of Abroad's 
expenditure on the good. A likewise consideration holds for good #2. Now it remains 
to observe that Abroad spends equally on both goods to deduce that Home export are 
(A + 1 - A) ¥1, a constant proportion of GDP. Thus trade volume is constant whatever the 
degree of specialization A (insofar as Abroad is inversely specialized). Obviously, for low 
A, countries are increasingly specialized and make mostly inter-industry trade. 

To appreciate the overall effect of free trade upon worker's utility, we look at the 
equilibrium utility of a Home worker earning w. Two effects are expected. Firstly, there 
is a distribution effect due to factor price equalization, it favors the abundant factor in 
each country but hurts the scarce one (who enjoys a rent under autarchy). Next, the 
trade union aggrandizes the economy thus increases available varieties to the benefit of 
all. The commercial union will be welcomed if Home workers in the scarce sector gain 
from it, which we now study. 

Since the underlying preferences leads to equal expenditure on both sectors (a = \), 
the equilibrium demand for sector i is the fraction 7^— of the consumer's income w. His 
indirect utility is then 

v{w) = }_^\og \riii- r\ =L\og— + p\ogni + cte 

it 2 \ 2n iPi j £ 2 pi 

with /5 = i#. W.lo.g. we set A < \ i.e., the scarce sector is #1. The utility variation for a 



sector #1 worker upon opening frontiers to trade is then 

A,, = KO-^i^logff/^+logff/fJ + ^logl + logl) 

= O + log^j-jSlogCACl-A)) 

= (l-^logA-d + ^logCl-A) (11.11) 

because w{ = w* implies § / f = (f if) g = §±gxg = f = £ = £ (applying * = (i + r 

repetitively). Likewise Al- 2 = log I ^ J - j61og(A(l- A)) > as A < |. In both sector, the 
worker suffers from specialization (first term) but gains from the larger market. For 
highly differentiated versions of basic products i.e., when 6 < \ (or (5 > 1), we have Avi > 0. 

For 6>\, (11.11) has a zero A(/3) = ^ with z = e TT P such that Ai/i > o A > A(j8). 

The model thus confirms that intra-industry trade among similar countries is benefi- 
cial for workers of all sectors whereas inter-industry trade, typical of highly asymmet- 
ric partners puts under duress the workers enjoying the highest wages under autarky. 

11.3 Vertical Differentiation: Quality 

In this section we study how quality, aka vertical differentiation, affects price competi- 
tion among firms. Quality can stand for features of a product such as air conditioning in 
a car but it also applies to services. As a matter of example, consider two supermarkets; 
in the first one, employees are well trained, well dressed and ready to help, shelves are 
nicely decorated. The other supermarket has a reduced number of employees and crude 
shelve presentation but most probably, this negative aspect will be compensated by lower 
prices. This does not prevent some consumers to persist in shopping at the dearest shop, 
simply because they pay more attention to the quality of the service, which compensates 
them for the higher prices. 

11.3.1 Quality and Market Power 

Since consumers have heterogeneous tastes and incomes, the willingness to pay for qual- 
ity differs widely; it therefore makes sense to develop a range of products (based on the 
same elementary commodity) and use differential pricing to maximize profits in this 
market. The example presented in Table 11.1 involves a brand of Spanish wine whose 
price per bottle, quoted in December 2005, varies according to the nurturing (time spend 
in oak barrels): 



Nurturing 


cosecha 


crianza 


reserva 


gran reserva 


Vintage 


2003 


2000 


1998 


1995 


Price (€) 


1,65 


2,65 


4,20 


8,10 



Table 11.1: Wine Qualities 

The theoretical relationships between quality and market power are those obtained 
in §3.3 summarized in Table 3.1. In the rest of the section, we shall concentrate on com- 
petition among firms over a market such as wine and assume that each offers a single 
good which may nevertheless be interpreted as a family of versions. For instance all 
Spanish wine makers offer the nurturings reported in Table 11.1. Yet, they differentiate 
themselves along quality characteristics such as grape origin, use of additives, use of me- 
chanical processing or the reputation of their enologist. In the end, these characteristics 
produce important price differentials. 



11.3.2 Price Competition 

We study Mussa and Rosen (1978)'s model of quality differentiation (cf. §3.3.1) as simpli- 
fied by Tirole (1988) to explain how price differentials justify quality wedges. To simplify, 
each consumers is ready to buy one unit of a durable good. The brand i good has an objec- 
tive quality index st e [0, 1] like those build-up by consumer associations or the specialized 
press for cars, domestic appliances and other durable goods. 21 ® Consumers are charac- 
terized by a "taste for quality" x uniformly distributed in [0; 1] ; their willingness to pay 
for good i takes into account the objective quality in a multiplicative way and is simply 
xst so that the utility of buying good i at price pi is the surplus Ui(x) = xsi - pi; refraining 
from consuming yields a utility normalized to 0. Two firms A and B propose brands of 
(potentially) differing qualities and then compete in prices. As explained in §2.4.2, this 
sequential game is analyzed by backward induction: given the qualities initially chosen 
sa and sb, the price equilibrium is identified, then we go to the first stage to solve the 
quality equilibrium. 

At the price competition stage, one quality being higher than the other, we relabel 
firms as h (high) and / (low); we have Sh > s/. The utility functions u^x) and U/(x) are 
drawn on Figure 11.7. The potential sales of firm i are l-xt where xt is such that 
Ui{Xi) = 0. As can be seen on the drawing, the quality differential advantages firm h. The 
indifferent consumer is x such that Unix) = ui{x) thus x = p J'Z Pl ■ 

s h s l 

The effective demands addressed to firms are D h = max{0, 1-x] and D/ = max{0,x-xi}. 
Notice that if p\ > Ph7~, then x < x\ so that the low quality firm is excluded from the 
market (D/ = 0) in which case the high quality firm becomes a monopoly. The demand 
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Figure 11.7: Utilities under Differing Qualities 



resulting from consumers' choices, given prices, are therefore 



Di(pi,p h ) = < 



1-f^ if pi<p h -s h + si 

p !* s r l~ p l\ h otherwise 



siU h -si) 







if Pi>Phr h 



(11.12) 



and 



D h {p h p h ) = < 



1-^ iip h <^pi 

1 - Ph ~ Pl otherwise 

S h -Si 

if p h >pi + s h -si 



(11.13) 



Whenever pi > ph~, firm / has zero demand and zero profit; she thus has an incentive 
to reduce her price in order to grab a positive market share. Hence, only D^ = 1 - x and 
Di = x-xi are relevant for the equilibrium analysis. 

The payoff functions in that case are 



n h (p h ,pi) = p h \l 



Ph-Pl 

Sh~ s l 



(11.14) 



and 



ni(Ph,Pi) = Pi 



PhSi-pis h 

Sl{S h -Si) 



(11.15) 



The best reply derived from the first order conditions -^- = and 3 = are 



yV)- Sfe "! + P ' and yr l (p h ) = p h Sl 



2s h 



(11.16) 



The price equilibrium solves simultaneously ip h {ph) = Ph and f l {ph) = PV, we obtain 

"< = 7^r and p » = ir p <- (1117) 

Observe that the nearer the qualities, the lower the prices. Hence, choosing a quality 
far away form's one rival increases the equilibrium prices and the resulting profits. We 
may conclude that 

Vertical differentiation enables firms to soften price competition. However the high 
quality firm is a clear winner as he gets to set a price more than twice that of his 
competitor. 



11.3.3 Quality competition 

This first stage where firms choose their qualities is analyzed taking into account that in 
the second stage of price competition, firms play the Nash equilibrium just characterized; 
this is to say we impose subgame perfection. 

Whatever the example we may think of, quality is costly to achieve, thus a good reason 
to lower one's quality is save on costs. To study with precision the other reasons that may 
lead a firm to lower her quality we will assume away the first reason i.e., suppose that 
quality is costless. 

The demands addressed to the firms at the equilibrium prices can be computed using 
(11.17) inside (11.12) and (11.13); we obtain D* = i ^ and D* = i |^. The corresponding 

profits are thus 

* * 4s 2 h {s h -si) 
n h {s h ,sO = p* h D* h = -f ^ (11.18) 

(4s h -si) z 

and 

Yll(s h ,si) = p l D l = =-. (11.19) 

It then remains to consider the first stage of the game where qualities are chosen. 
The profit of firm i = A, B has 3 parts depending on whether products are differentiated 
or not. If identical qualities have been chosen then the two products are homogeneous 
and the price competition is of the Bertrand type leading to zero profit since in the Ber- 
trand equilibrium, firms quote their marginal cost (which is zero here). Otherwise, the 
previous analysis applies taking into account the fact that z"s product might be the high 



or low quality one. We can therefore write 22 ® 



ni{Si,Sj) = < 



Ui{Sj,Si) ifsi<Sj 

ifsi = Sj (11.20) 

U h {Si,Sj) ifsi>Sj 



The reader will easily check that — h gs h ' - > which means that the quality leader 
would like to propose the highest possible quality s = 1, because it boosts her sales and 
price. On the other hand, one can check that u '^" Sl] = for s/ = |s^; this is the optimal 
degree of differentiation for the low quality firm. Lastly, no one wants to match the 
competitor's quality because it would trigger the damaging Bertrand competition. The 
equilibrium of the first stage is thus (si, s 2 ) = (f , 1) or (si, s 2 ) = (1, f ). The high quality brand 
is not known but this is not crucial given that A and B are names of firms without history. 
In equilibrium, the high quality firm quotes a price \ times larger than the low quality 
one and sales twice as much, thereby achieving a seven fold profit (check as an exercise 
that Pi = ji and D\ = ^). We may conclude this analysis by stating 

I Firms are likely to relax price competition (Bertrand paradox) through vertical prod- 
uct differentiation. 



11.4 Drivers of Differentiation f 

11.4.1 Differential Pricing 

Business Stealing and Versioning 

The "punch line" of this chapter is that differentiation is a business stealing vehicle i.e., it 
is a resolute strategy for firms to make distinctive and more attractive products in order 
to capture greater market shares. At the same time, differentiation is also intuitively 
understood as a channel for applying differential pricing in which case, we speak of 
versioning (cf §4.3.4). The idea is to partition a market into segments of homogeneous 
customers, design an appealing version of the basic product for each segment and price 
it at a greater margin. For instance, 

• Perfumes come in two gender-oriented fragrances sharing the same base. 

• Computers and softwares are differentiated across their destined use as in professional 

vs. home. 

• Cars often come with a hatchback, sedan, station-wagon and coupe version. 



• Trains or planes offer classes with increasing levels of quality of service (both on board 

and on ground) to reach segments of users with increasing willingness to pay. 

• Book are sold in hardcover and paperback editions, through readers' clubs and as sur- 

plus books in discount shops. 

The two drivers of differentiation, business stealing and differential pricing, have 
been considered from two radically different perspectives. In chapter 4, firms design 
complex price discriminating strategies but are insulated from competition as we endow 
them with an exogenous degree of market power. In the present chapter, firms directly 
compete but are forced to use simple unit price strategies. 

Differentiation and Discrimination 

We follow Armstrong (2007) and bring together differentiation and discrimination in a 
very simple framework (cf also §5.3.4). Recalling that a monopoly is always willing to 
use price discrimination, we are, in fact, testing the robustness of this conclusion to the 
introduction of direct competition among firms. In the wider interpretation of horizontal 
differentiation on characteristics, the transportation cost represents how much an indi- 
vidual dislikes buying his less preferred brand; we may thus speak of the choosiness. 
Our findings summarize as follows 

I The ability to differentiate prices based on WTP, choosiness and heterogeneity is re- 
spectively useless, useful and harmful. 

Information on WTP Recall that in the standard Hotelling model (without produc- 
tion cost), consumers display horizontal heterogeneity with a taste parameter x £ [0;1] 
while firms are horizontally differentiated with xa = and xb = 1. We can then add 
vertical heterogeneity with choosiness or transportation cost te [to; til- The WTP v for 
the object may be heterogenous but is nevertheless assumed large enough to generate 
a purchase. We assume that taste, choosiness and WTP parameters are statistically 
independent. 

If a monopolist can observe individual WTPs, we already know that he will sell to 
each person at her WTP minus the transport cost. The outcome is efficient and max- 
imizes profit. If now, duopolists observe individual WTPs, they fail to gain anything 
because each buyer takes his decision on the basis of the price difference. The demand 
received by a firm is thus independent of the WTP of purchasers and so is the equilib- 
rium. Observe also that since the information is not used, the same outcome arise in 
equilibrium whether one, both or none of the firm knows the WTPs. Hence this piece of 



information has no value except to a monopolist. This result extends to situations where 
consumers buy multiple units and multiple products meanwhile the location parameter 
remains unknown to the firm for this is exactly why competition makes this information 
irrelevant. 

Information on Choosiness When firms can discriminate over choosiness, they re- 
peat the same basic interaction in all segments. They fight more over people who see 
their products as close substitutes which is detrimental to profits. At the same time, 
firms relax competition over people who are addicted to a specification which is beneficial 
for profits. Overall, firms gain more than they lose when compared to non discriminatory 
pricing as we now proceed to show. 

Firms offer prices pa and pb to consumers of segment t. The indifferent customer is 
x t =\- PA 2 t PB and the equilibrium prices are Pa = Pb = t. Overall, the average price is 
i = E[t]. If firms cannot discriminate, they post a single price valid for all segments. The 
demand addressed to A becomes the expected indifferent customer E[x t ] = \- Pa ^ Pb where 
t = 1/EIy] is the harmonic mean of choosiness. The equilibrium prices are pa = Pb = i <t 
by Jensen inequality (cf. footnote 26.3). This means that discrimination over choosiness 
raises prices and profits. Picky people (high t) end up paying more under discrimination 
and make the extra profit. In the absence of information, firms fail to evaluate precisely 
how more or less choosy people value the price difference so that a discount has a greater 
business stealing effect, thereby intensifying competition. 

Information on Taste The taste parameter x of consumer is the original source of 
differentiation among firms. When they can discriminate over that characteristic, a price 
war ensues to the benefit of consumers and to the detriment of firms. Indeed, each firm 
is now able to extract large chunks of surplus from people close to their offering (in the 
characteristics space). But this means that the strong market is now specific to each firm, 
unlike the case of choosiness discrimination. So, when a firm starts price discriminating 
and lowers the price in her weak market, she is frontally attacking the other side's strong 
market. The lack of agreement on what is the local monopoly for each firm leads to a 
prisoner's dilemma as we now proceed to show. 

If firms can discriminate on taste only, they offer prices p x A and p x B . When choosiness 
t is common, firms compete a la Bertrand for each consumer. A consumer with type x < \ 
shops at A if 

p x A + tx < p x B + r(i - x) o p x A < p x B + r(l - 2x) 

This means that B is undercut without recourse by the price f (1 - 2x) which is thus the 
equilibrium price. Likewise, B takes the other side of the market. Since every consumer 



pays less than the price in the absence of information which is here t-t-t, price dis- 
crimination is beneficial for consumers and detrimental to firms (who lose half of their 
profits as individual profit is f 1/2 t(l - 2x) dx = |). 

To complete the analysis, it is useful to look at the case where choosiness is not com- 
mon to all but, for instance, uniformly distributed over [to; h\. Consider the segment x < \ 
and discriminating prices pa and pg. By pricing at toil ~2x), firm A can exclude B and 
make on average j. If this is a dominant strategy, it will also be for B for x > \, so that 
total profit for each firm is -|, lesser than under no information. It remains to study the 
strategy by which firm A sells dear to choosy people leaving nearby. The price pa attracts 
types such that t > z = v ^-2x hence d,AipA>pB) = j^f and ^ symmetry dsipA^PB) = fzf - - 
The standard Hotelling competition leads to best replies 

p B + h{l-2x) pa- toil -2x) 
Pa = and p B = 

The equilibrium solving this system is 

, i2t 1 -t )il-2x) t ih-2t )il-2x) 
Pa = = and Pb = = 



and leads to profits 



H-2x)i2ti-t f , il-2x)iti-2tof 



%a = and 7r r 

9Ui-fo) 9ih-t ) 

By symmetry for the case x> \, each firm earns £[71^ + ^5] = \ m °_ t r ° - Now, this com- 
petition (and its equilibrium) is valid if the proposed equilibrium price is greater than 
the undercutting strategy which reads 2tl ~ to > t Q i.e., if t is small compared to t\. In the 
limiting case where t vanishes, the equilibrium profit tends to ||- = y > |, the payoff 
under uniform pricing. Hence, firms take advantage of price discrimination over taste 
(x) if choosiness (t) displays a large heterogeneity because segmenting clients by taste 
amounts to engage in Bertrand competition. 

A price discriminating monopolist is never worse off. Likewise, an oligopolistic firm 
is always better off if it can price discriminate, for given prices offered by its rivals. 
However, once account is taken of how rivals will react, firms find themselves trapped 
in a prisoner's dilemma and lose out (cf. §2.4.1). In relation to this theoretical finding, 
many suspect that, at the end of the XIX th century, US railways companies welcomed 
the tariff regulation prohibiting price discrimination precisely because this practice was 
destroying their profits (cf. §9.2.3). 



11.4.2 Cost Edge 

Tyagi (2007) shows in a price competition framework that similar firms would like to 
differentiate more. However, if one firm has a strong cost advantage, it may prefer to 
reduce differentiation in order to take full advantage of its lower cost. 

We use a simplified version of the price system (5.18) describing price competition in 
a differentiated environment with q t = l- p t + d{pj - pt). We set c\ = and c 2 = c so as 
to give firm 1 a cost edge. Profits are ii\ = q\p\ and % 2 = qiiP2 - c). The FOCs of profit 
maximization are 

0=l-2{l + d)p 1 + dp 2 Pl = 2(T^) ^ Pi = (d + 2K2 + 3d) 

o = i + (i + d) c-2(i + d)p 2 + ^ 1 Pa = 1+(1 2 a1g rfpi P2 = Pi + w 

The relevant range for the cost disavantage isp2>coc<c = 4 ^2 +2 ■ The equilib- 
rium profits being nt = (1 + d) {p* - ct) (cf. solution of system (5.18)), we can now inquire 
whether the two firms like or not differentiation. We find out 

oc {d 2 + 4d + 2)c-2-3d = [d 2 + 4d + 2)(c-c)<0 



dd 
dn 



dd 



- oc {l + d){3d 3 + 18d 2 + 20d + 8)c-d{2 + 3df <0o c<c 



where c = (l+d){ ^S + 20d + s) < ~ c since c-coc24d + 24d 2 + 7rf 3 + 8 > 0. 

We conclude that the weak firm always prefers more differentiation (lower substi- 
tuability parameter d) in order to relax the price competition where she stands at a 
disavantage. The strong firm has no such clear cut preference; she displays the same 
preference if the cost advantage is small but otherwise she prefers less differentiation in 
order to take full advantage of her large cost edge. 

11.5 Advertising 

The OECD estimates that member countries spend more than 2% of their GDP in ad- 
vertising. Table 11.2 present estimates in 2003 bn$ from consulting firm Zenith Optime- 
dia regarding advertising expenditure in major media such as newspapers, magazines, 
television, radio, cinema, outdoor, internet. The recent survey on advertising by The 
Economist is an excellent update on the topic. At the individual level, The 2002 report 
on Leading National Advertisers in the US by Advertising Age magazine informs us that 
many companies among which General Motors (cars), Procter and Gamble (detergents), 
Ford (cars), Pepsi (beverage), Pfizer (pharmacy), AOL (media) spend more than 2 bn$ in 



advertising on the US market alone in 2001. 



Region 


2003 


share 


2005 


share 


2007 


share 


North America 


158 


46% 


175 


45.0% 


193 


44.5% 


Europe 


89 


26% 


99 


25.4% 


108 


25.0% 


Asia, Pacific 


70 


20% 


80 


20.6% 


91 


21.0% 


Latin America 


14 


4% 


17 


4.4% 


19 


4.4% 


Africa, M. East 


14 


4% 


18 


4.7% 


22 


5.1% 


World 


346 




388 




434 





Table 11.2: Advertising Expenditure 

These stylized facts make advertising a relevant topic of study. We rely on the exten- 
sive review of Bagwell (2007). 

11.5.1 Opposite Views 

The two basic functions of advertising are to inform and to convince. It has obviously 
no place in a perfectly competitive economy since consumers are perfectly informed and 
firms can sell as much as they want at the market price. Advertising is thus a feature of 
an imperfectly competitive economy. 



Persuasive View 

Advertising clearly speaks to our subjectivity in trying to build loyalty and make us feel 
secure; for instance, shoe manufacturers like Nike or Adidas advertise their footwear 
with famous champions to enable us, buyers, to identify with our idols. 

The negative view, dominant in the first half of the 20th century, holds that advertis- 
ing alters consumers' tastes to create a non genuine brand loyalty. The resulting product 
differentiation makes the demand for the advertised product more inelastic and thus 
enable the firm to sustain higher prices. Since advertising is of a fixed cost nature it 
participate to erect a barrier to entry. Indeed, advertising by established firms creates a 
reputation for their brands and new entrants can succeed only by developing their own 
reputation through even more advertising (they must induce consumers to switch from 
an established and familiar product to a new and unknown one). 

It has been long recognized that advertising enables to steal business from competi- 
tors so that it any campaign will be counteracted by more advertising but since the mar- 
ket remains the same, at least if we consider a loose definition of the market, resources 
spend on advertising are wasted since fail to generate any form of wealth for society. 



The three arguments of fooling consumers, erecting barrier and resource waste add 
up to produce the general conclusion that advertising is anti-competitive. 

Informative View 

According to this positive view, advertising signals the existence of a product and some 
of its competitive attributes like price, retail location or quality attributes. Consumers 
therefore save on the search costs of learning price, quality and the very existence of 
products. When the information revealed is trusted (e.g., local products advertised in 
local newspapers) the degree of differentiation due to misinformation of consumers is 
reduced hence advertising is pro-competitive. 

Secondly, this socially useful information, is made available at a lower cost when pro- 
duced by firms for the simple reason that the mass media permits the diffusion of a single 
message to large audiences at a reasonable price. In the 1960s, Chicago economists like 
Stigler and Becker went further in arguing that advertising was the endogenous mar- 
ket response to the market failure generated by the imperfect consumer information. 
They reason that the advertiser's demand curve becomes more elastic (price reductions 
become more effective business stealing instruments) i.e., promotes competition among 
established firms. As well, advertising facilitates entry as it provides a means through 
which a new entrant can publicize its existence, prices and products. Their conclusion is 
that advertising can be pro-competitive. 

Complementary View 

This last and most recent view tries to implant advertising inside the consumer prefer- 
ences to avoid the ad-hoc formalizations of the old views. Advertising is then seen as a 
complementary good to the advertised product. What the consumer values is the good 
z = g{a)q where q is the product quantity and a the amount of advertising. The positive 
function g captures the character of advertising, either useful (g' > 0) or annoying (g' < 0). 
For example, consumers may value "social prestige", and the consumption of a product 
may generate greater prestige when the product is advertised. An important implication 
is that standard methods may be used to investigate whether advertising is supplied to 
a socially optimal degree, even if it conveys no information. 

It is often infeasible to separately and directly sell advertising to consumers. Instead, 
advertisements is given away like for direct mail ads or sold jointly with the other prod- 
ucts like for a newspaper. The former case may be understood as a situation in which 
advertising is a good (or at least not a bad) that is given away, the quantity of advertising 
is determined by the producers, and each consumer simply accepts (consumes) all of the 



advertising that is received. The latter case corresponds to a situation in which each 
consumer determines his consumption quantity of the joint good, given the price of the 
joint good. As advertising is complementary, it may be sold at a subsidized implicit price. 
Indeed, if advertising is a bad (e.g., TV ads may lower utility), then its implicit price is 
negative (advertisers include free and enjoyable programs to compensate the viewer for 
watching the ads). 

Historical notes 

Early on, Marshall (1890) saw a constructive role for informative advertising but also 
noted that it could be socially wasteful when involving repetitive messages whose pur- 
pose is to steal the customers of competing firms (the resources spent by firms fail to 
generate anything useful for the economy). 

Robinson (1933), among others at the time, pursues more systematically the integra- 
tion of advertising in her theory of monopolistic competition. Even though she does not 
model consumer behavior, her reasoning leads to the conclusion that advertising is both 
persuasive and informative (cf next paragraph) but is overall anti-competitive. 

Kaldor (1950) adds scale economies as an indirect positive effect of advertising. The 
advent of large-scale advertising enables manufacturers to establish brand names and 
leap over wholesalers to establish a direct connection with consumers. The manufactur- 
ing sector then becomes more concentrated, and additional scale economies associated 
with mass-production techniques are realized. The resulting structure is thus charac- 
terized by low production costs and high selling costs (marketing and advertising). This 
theory has some resonance in today's business which we proceed to discuss using exam- 
ples from two sectors. 

Illustrative Cases 

Food It is well know that products displayed on front shelves sell much more. Large 
retailer chains like Carrefour or Ahold exploit this multiplier effect to charge brand name 
producers an extra fee to get these precious locations in their stores. This behavior has 
been deemed anti-competitive in several European countries but it is quite difficult to 
fight. The only way to regain bargaining power with respect to retailers is to be more 
famous than they are in the mind of consumers. An illustration of this is the very large 
amount of advertising on European TV channels made by Danone 23 ® and Nestle 24@ ; 
by spending large resources in advertising both try to reach the degree of visibility of 
the world leader in image, Coca-Cola. The idea is quite simple: a supermarket cannot 
default on Coke without losing instantaneously most of its clients. 



Hotel Since the advent of the internet and communication technologies (ICT), most 
leisure travels are sold through the 4 Computer Reservation Systems (CRS) Sabre, Amadeus 
Galileo and Worldspan that gather offers from dozens of suppliers and dispatch them 
to thousands of travel agencies. Nowadays, with the internet, we are millions to ac- 
cess their databases through the sites of Expedia or Travelocity Hotel companies with 
brand names like Hilton or Sheraton fear to be diluted in the catalogue of these powerful 
aggregators; in that case they would have little bargaining power to negotiate the fee 
they pay to get a good position in the window that the consumer will browse to pick its 
travel. They have thus started intensive campaigns of "branding" to increase the fidelity 
of clients and have them connect directly to the hotel internet site instead of looking for a 
rebate on some wholesaler site. Likewise, airlines companies in the US have joined effort 
and created Orbitz, an internet site selling their own seats at better price and conditions 
than competing CRSs to avoid being squeezed. 

Empirical findings 

It has been shown empirically that brand advertising has a significant effect on the 
brand's current and future sales (goodwill effect) but sales appear more responsive to 
price and quality 25 ® With respect to competition, market share are negatively related to 
rival advertising and an increase in own brand advertising appears to induce rivals to 
respond with more advertising. Overall, the total demand seems responsive to advertis- 
ing. 

Using data on the Coca-Cola and Pepsi-Cola markets over the 1968-86 period, Gasmi 
et al. (1992) show that Coca-Cola was a Stackelberg leader in price and advertising until 
1976. Afterward, competition took the form of duopoly conduct characterized by collusion 
in advertising and possibly price. In this context, the empirical estimates suggest that 
advertising in the cola market is largely combative. 

Kadiyali (1996) analyzes the U.S. photographic film industry. In the 1970s, Kodak had 
a virtual monopoly but accommodated entry by Fuji in 1980. The parameter estimates 
for the pre-entry period (before 1980) indicate that Kodak maintained its monopoly po- 
sition by using limit pricing and high advertising. Estimates for the post-entry period 
suggest that Kodak was compelled to accommodate Fuji who enjoyed demand and cost 
advantages, that Kodak and Fuji then colluded in price and advertising and finally that 
advertising expanded market size and constituted a public good across firms. 



11.5.2 Theories 

Monopoly 

The standard model of monopoly advertising by Dorfman and Steiner (1954) can be pre- 
sented as follows: consumer demand depends not only negatively on price p but also 
positively on advertising expenditures a i.e., q = D{a,p) with D a > 0. We dot inquire here 
into the origin of this effect. 

The monopoly profit is n = pD{a,p) - C{D{a,p)) - a and the FOCs are the traditional 

Lerner(cf. eq. 3.4) equation 

p-C m 1 



P e P 

where e p is the price elasticity of demand (cf eq. (2.14) ) and the novel equation 



(11.21) 



(p-C m )D a = l o _^ = ^ (11.22) 

pD e p 

where e a is the advertising elasticity of demand. The RHS of (11.22) states that a profit- 
maximizing monopolist spends on advertising a proportion of revenue determined by a 
simple elasticity ratio. 

Application to the persuasive view 

Consumers, willing to buy one unit of the good (e.g., a car), are differentiated by their 
basic willingness to pay v. Advertising then acts as a multiplier yielding social prestige 
upon owners; the net WTP is thus vg{a)-p where g(0) = 1 and g' > 0. Assuming a uniform 
distribution of consumers of total mass one, the demand addressed to the firm is D{a, p) = 
1 - -£) since the indifferent consumer has WTP -^. We have §^ = -=X. 

As the persuasive view suggests, when advertising is increased, the demand function 
becomes more inelastic since e p = ( j|_ => -^ < 0. The persuasive view also holds that the 
profit-maximizing price p^f, solving (11.21), rises when the level of advertising is raised. 
To see if this occurs in the present setting, we can study the FOC of price optimality 
[p - C m )j^ + D = which simplifies into 2p^ = g{a) + C m for our particular demand. It 
is then clear that an increase in advertising a, increases the optimal price p^f through 
the effect on social prestige (g' > 0) but nothing can be said outright for the effect of 
technology 26 ® 

If there are diseconomies of scale in production (C m /), an increase in advertising 
increases sales and marginal cost so that the price is increased to preserve the margin 
as measured by the Lerner index (cf. §11.21); the claim is thus correct. If, on the contrary, 
the technology shows strong economies of scale, then the now negative scale effect can 



outbalance the direct advertising effect so that greater advertising leads the monopoly 
to lower his price. To conclude, the model is useful to estimate when the persuasive view 
is correct or not relative to the effect of advertising on price. 

Application to the informative view 

If advertising only informs consumers of the existence of a product, the demand ad- 
dressed to the monopolist is the product of the individual demand d{p) by the informed 
consumer base g{a) where we can assume g' > > g" (decreasing returns to scale of ad- 
vertising). In such a case, the price elasticity of aggregate demand is equal to the price 
elasticity of individual demand (the advertising effect g{a) cancels out in computing e p ). 
Upon examining the Lerner equation (11.21) for the optimal price conditional on ad- 
vertising, we see that the impact of advertising is entirely dictated by the volume or 
scale effect (the quantity in C m ) i.e., the direct effect present in the previous application 
is absent. The effect of advertising is thus straightforward: if there are economies (resp. 
diseconomies) of scale in production then more advertising lowers (resp. raises) the opti- 
mal price but in contrast with the previous application, if the marginal cost is constant, 
then advertising has no effect on the monopoly price. 

Dixit and Norman (1978) provide a foundation for the normative theory of persuasive 
advertising. If the consumer welfare is measured relative to a fixed standard, then a 
monopolist provides price-increasing advertising (as in the first application) to an ex- 
tent that is socially excessive. Proponents of the informative and complementary views, 
however, argue that the fixed-standard approach ignores consumer-welfare gains from 
advertising that are associated with information and social prestige. Under their alter- 
native approach, a monopolist provides price-maintaining and price-decreasing adver- 
tising to an extent that is socially insufficient. In conclusion, the judgment we can hold 
on advertising depends on moral values and on whether "social prestige" should enter 
welfare calculations. 

Duopoly 

In oligopoly markets, advertising is an important instrument of competition. The ad- 
vertising of one firm may steal the business of other firms and thus lowers their profits. 
This business-stealing externality raises the possibility that advertising may be exces- 
sive because once a firms starts spending on advertising, a challenger may be forced to 
spend twice as much to retain or regain his customers. Given the intensification of com- 
petition, it is unclear whether advertising will be socially excessive or not. This issue is 
the earliest example of economic conflict assessed analytically in the literature (cf §7). 



To inquire this quandary we use Grossman and Shapiro (1984)'s version of the Hotelling 
model with extreme locations. The transportation cost is t, the marginal cost of produc- 
tion is c and the advertising technology works as follows: the cost to reach (or inform) 
a proportion x of the street (potential clients) is A{x) = ax 2 12. Let us interpret the pa- 
rameters. The total transportation cost for consumers to learn one price by themselves 
(searching) is / txdx -\t. The cost for a firm to inform everybody of its price is a/2, 
thus the cost for suppliers to inform buyers is a. As we shall see, the equilibrium will 
be highly dependent on which of \t or a is the greatest i.e., who owns the most efficient 
information transmission technology. 

Firms choose prices p\ and p 2 and advertising levels xi and x 2 . Given the potential 
demand X\ addressed to firm 1, x\ (l-x 2 ) consumers are captive as they ignore p 2 , they buy 
if p\ < S, the reservation price. A proportion x\x 2 on the other hand is fully informed and 
solve the classical Hotelling trade-off. Demand is thus D* = xAl- Xj + Xj Pl ^ — f° r J = 1> 2 - 
Profit is then ni = {pt - c)Di - A{xt). The elasticity of demand at p\ = p 2 = p, x\ = x 2 = x is 
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- and is thus increasing with advertising. The FOC of optimal pricing leads to 

P j + t + C 1-Xj 

Pi = ! - L ^ + ~t (11.23) 

2 xj 

where the second term is the mark-up enabled by the lack of information of some con- 
sumers. The FOC of optimal advertising leads to 

l-x i + x 1 Pi ~ Pi+t ) (11.24) 



a \ J It 

Looking for a symmetric equilibrium {p,x), (11.23) simplifies into p = t+c+2t^- while 
(11.24) yields 2x = ^-{2-x). Using the former, the latter further simplifies into 

2x p-c t ( l-x\ 

= -1+2 o x = <1 



2-x a a\ x ) l + V2a/t 

if t < 2a. The equilibrium price is then p* = c + t + 2t^- = c + \/2at > c+t leading to profit 
n = 2a (l + \j2alt) . We observe from these formulas that if t > 2a then competitive forces 
push firms to inform the whole street (x* = 1) and price the good at p* = c+ 1, the complete 
information Hotelling price. This occurs because the information technology of producers 
is much more efficient than the searching technology of consumers. 

In this model, the price is weakly larger than under full information and is increasing 
in t but at a slower rate (over the range t < 2a). The reason is that more differentiation 
makes advertising more valuable for firms thus increases the amount of information 
revealed to consumers in equilibrium; competition is increased thanks to the possibility 



of advertising. Oddly, the equilibrium profit is increasing with a, the cost of revealing 
information; although the direct effect of a is negative, its strategic effect is positive 
and larger. Given our previous analysis this is not so much of a surprise because a 
larger a basically means that firms find it more costly to reveal information, hence they 
reduce their advertising which enables to sustain larger prices. Therefore, a little tax on 
advertising (disguised as a moral requirement) may raise profits for firms and the tax 
collection for the State so that we, consumers, loose on both sides !! 

Signaling Quality 

We argue here following Kihlstrom and Riordan (1984) that advertising can be a signal 
of quality in an oligopoly setting if a high quality producer is patient enough. Since the 
exposition bears considerable similarity with §21.1.3 on "education as a signal of ability", 
the model is only sketched. 

Whenever there is no cheap and reliable agency that can emit a certificate of quality, 
firms are not able to inform correctly their potential customers about the quality of their 
products. For such experience goods, the opinion of consumers is either confirmed or 
revised upward or downward according to what they bought and what they anticipated. 
The following "hit-and-run" strategy immediately comes to mind to take advantage of 
this delay in the revelation of true quality: package a cheap and low quality product as 
if it was top-notch, sell it at a high price for one period and then lower the price since 
nobody will get fool now that the word has spread regarding what was the real quality 
inside the gleaming package. 

If we let n 1 . denotes the profit during one period made by selling quality i when con- 
sumer think it is quality j then we obviously have the ranking n^ > n 1 ^ since cost are 
lower for a lower true quality but also n 1 ^ > n L L since consumers are ready to pay more 
for (what they think is) a higher quality. The present value of aggregated profits over 
many periods for the "hit-and-run" strategy is n^ + \ti l l where r is the interest rate and 
is obviously greater than the profit n L L + \n L L gained by a truthful maker of a low quality 
product. 

To force out the deceptive strategy of "hit-and-run", a high quality producer can sim- 
ply spend an amount of advertising a so large that a low quality producer would not 
want to imitate this behavior for the sake of being see at "HQ". The condition is 

a>n?-7t L L (11.25) 

Yet the high quality producer should take care of not spending too much in advertis- 
ing because he still the option of switching to low quality himself at no cost. This won't 



occur if 

TiH-a+-nH- n L + - n L °" a< { n H~ n L) (11.26) 

A first necessary condition for advertising to be a signal of quality is n 1 ^ > n L L i.e., 
high quality products must generate more profits than low quality ones in a perfectly 
informed market. As we saw in §11.3, the condition is satisfied. Comparing (11.25) and 
(11.26), we derive a second necessary condition for advertising to be a signal of quality: 

n H -7i L 

r < x_ x i.e., the producer is patient and cares mostly for future profits. 

I A high quality producer can signal his characteristic in a discriminatory fashion (avoid 
mimicking by low quality imitators) only if his product has a long life cycle. 



Chapter 12 

Research and Development 



In a market economy, the ultimate ruler is the consumer. Generally, he/she agrees to pay 
more for quality and although higher quality products tend to be more costly to produce, 
they generate higher profits as we showed in §11.3. It is thus rational for a firm to invest 
financial and human resources in order to become a quality leader. This constitutes an 
example of sunk cost because these resources cannot be reoriented without suffering a 
minimal loss (opportunity cost). The subjective traits of "quality" have been studied with 
advertising in the previous chapter. In the present one, we deal with the objective side. 

Innovation according to Schumpeter (1942) includes the entire process of inventing, 
demonstrating, bringing to the market and improving a novel product. It involves two 
complementary activities, invention and melioration, respectively carried by small and 
large firms. Big firms specialize on improving of existing products (e.g., new features, 
increased reliability, enhanced user friendliness, new uses) whereas small firms special- 
ize on discovering new products. Because few inventions have market value and even 
fewer are successfully brought to the market, entrants suffer a rate of failure (exit). In- 
cumbents, on the other hand, follow a routinized and conservative approach to achieve 
strategic objectives such as meeting the R&D effort of competitors. Their bureaucratic 
control of R&D ensures modest, predictable and incremental changes whereas inven- 
tions have the opposite characteristics. We shall not delve further into the intricacies of 
invention and melioration. 

In this chapter, innovation or Research and Development (R&D) stands for all activ- 
ities aimed at creating and improving products and services or at reducing the produc- 
tion and marketing cost; for instance, "Smart" is an innovative 2-doors car created by 
Daimler-Chrysler for city use whereas the "just-in-time" inventory management is a cost 
reducing innovation (cf. OECD for updates on Innovation). 

We shall first discuss the value of innovation for society and how it is protected. We 
then study how and why firms spend in R&D spending in a variety of market structures. 
We assess optimality of using welfare as the measure of social desirability. An important 



theoretical development is the chaining of R&D and standard market competition. We 
also show how R&D can become a strategic device for governments. The last section 
presents a wealth of cases of violations of Intellectual Property Rights (IPRs). 

12.1 Social and Legal Matters 

12.1.1 Social Value 

Health related research, both public and private is obviously beneficial to the human 
kind, directly in terms of better quality of life and longevity but also indirectly by per- 
mitting a longer and more productive working life. Innovations like the Instant Noodles 
or Tetra Pak are desirable for society because they increase consumer satisfaction (and 
welfare). Technological advances such as radio, disc, movie, TV, CD, DVD, PC and the 
internet have change the way we entertain. Other innovations of a more technological 
kind reduce production costs: 

• The natural gas combined cycle is an electricity turbine that doubles the energetic 
efficiency or halves the cost of producing electricity with respect to single fuel tur- 
bines. 

• The invention of dynamite by Alfred Nobel enabled many public works by drasti- 
cally reducing the cost of drilling mountains. 

• The distillation of kerosene from petroleum and later of gasoline made air and road 
transport possible. 1@ 

A few great inventions like penicillin came by surprise but most are the outcome of 
a long process starting with investment in human and physical capital (e.g., brains and 
laboratories); in other words they are very costly. DiMasi et al. (2003) in their study 
of R&D by the US pharmaceutical industry show that less than 1% of the molecules 
examined in the pre-clinical period make it into human testing and of these, only 20 % 
pass the development process and get approved by health authorities. The average R&D 
cost for drugs marketed in the late 1990s is $800 millions. 2 ® 

Since the limited human and physical resources of the economy have many useful 
alternative uses, we are warranted to study whether the levels of R&D expanded by 
private firms are efficient i.e., whether this activity should be supported or discouraged. 
One very important argument in favor of public subsidies for R&D other than health 
related ones is the positive externality generated by R&D upon growth in industrialized 
countries as attested by empirical investigation. 



12.1.2 Legal Background 

R&D is so costly that firms often need to cooperate to finance projects; it is also frequent 
to observe public agencies organize and finance networks of applied research among 
national firms to reach specific industrial goals. At the European level, the EC dedicates 
large amounts to bring forth international teams gathering researchers from the public 
and private sectors (cf European Research Area). The case of fundamental research is 
not treated here; since this activity has no direct market value and is a public good, no 
private firm can find it profitable to finance it, so that financing rests entirely on the 
public power. 

Now, the idea of cooperation being akin to collusion and anti-competitive behavior, we 
might worry that R&D generates more costs in terms of increased market power than 
benefits in terms of innovation. As we saw in §9, collusion may result in a lessening of 
competition and cause negative market effects with respect to prices, output, innovation 
or the variety and quality of products. On the other hand, R&D cooperation can be a 
means to share risk, save costs, pool know-how and launch innovation faster. In partic- 
ular, for small and medium-sized enterprises, cooperation can be of great help to adapt 
themselves to the changing market place, d' Aspremont and Jacquemin (1988) study the 
descriptive as well as normative implications of collaboration in R&D in the presence of 
spill-overs: the fact that the individual research effort of one firm is useful to both firms 
because of a network effect (they belong to the same industry and both interact with 
public research). These authors globally find out that R&D is insufficient with respect 
to the efficient level which would fully internalize the network externality. If the spill- 
over is large enough then allowing cooperation in R&D but not in production increases 
total R&D and total production towards the efficient levels. A full normative assessment 
is more difficult because more cooperation (either none, only in R&D or at both stages) 
generates more R&D but tend to decrease output. 

These arguments together with the positive externality on growth alluded before have 
been recognized by the EU in its competition law. The Horizontal Agreement is an ar- 
rangement between actual or potential competitors, operating at the same level of the 
production or distribution chain. The agreement can cover R&D, production, purchasing 
or commercialization. 3@ A Block Exemption is an authorization by the EC for certain 
types of agreements which are exempted from the prohibition on restrictive agreements 
(cf. §8.2.4). Block exemptions exist, for agreements of vertical integration, R&D, special- 
ization, technology transfer and car distribution. Consortia between shipping companies 
for the joint operation of liner transport services benefit of a block exemption and are 
thus free to restrict competition within the common market and to affect trade between 
member states. 



12.1.3 The Patent System 



Copycats and Free riders 

Consider the example of continuous innovation displayed on Figure 12.1: the first item 
is a traditionnal glass bottle, the second is a plastic bottle introduced in the 1960s, the 
thrid is a svelte plastic bottle designed to facilitate handling (1980s), the fourth is an 
easy to dispose and environmentally friendly model (1990s) and finally we have the body 
shaped model, easy to carry everywhere thanks to the incorporated handle. Each of 
these innovations is readily reproducible by a copycat as it is based on a simple idea that 
improves the service delivered to the consumer. For more complex discoveries, reverse 
engineering is necessary in order to discover how to copy the innovation (cf DVD history 
12.3.4). 4@ 
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Figure 12.1: Easily Imitable Innovations 



Free riding occurs when an entrepreneur builds a large showroom and train her staff 
to demonstrate consumer appliances. Too often, visitors flock into the shop to test the 
products from world known brands but then go to buy at a nearby shop whose business 
strategy is to do without such costly niceties. 

It is crucial to understand that, in the absence of protection for easily duplicable in- 
novations or ideas, Bertrand competition "kills" innovation because the invention, like 
a public good, suffers from free riding. Indeed, when one firm innovates, either through 
a cost reduction or through a new product, other firms promptly copy the innovation so 
that the innovator enjoys monopoly profits for a short period only. Indeed, they soon 
compete on equal foot in a Bertrand fashion and are lead to price nearby their marginal 
cost so that their economic profit is almost nil. Recall that the economic marginal cost 
remunerates the capital employed in production but not the initial investment in R&D 



because it is a sunk cost. Hence, the rational decision of any firm present in that mar- 
ket could well be to wait for someone to sunk the money necessary to come by with the 
innovation and then free ride on the innovator as quickly as possible. In equilibrium, in- 
vestments into R&D are delayed and shrunk, the global pace and intensity of innovation 
is thus severely hampered. 

The iPod success story is a reminder that such a fate is avoidable. The complemen- 
tarity between the iPod hardware and the iTunes software has given Apple an edge that 
even experienced and wealthy competitors like Microsoft have fail to catch up with (cf 
also §24). 5@ 

Legal Protection 

In such circumstances, there seems to be a market failure that keeps R&D from reaching 
an efficient level in the industry because innovation cannot be adequately rewarded. The 
patent system (cf. §12.3.1 for a precise definition) is a simple and quite efficient answer 
to this problem. As soon as a firm discovers a technical device, a molecule for a drug 
or a software code, it rushes to the patent office to apply for patent protection. If the 
application is accepted, the patent holder is bestowed a legal monopoly over the next 
20 years 6@ for the use of the patented item; after that period the product has generally 
become obsolete. 7 ® This patent race displays the "winner gets all" paradox because the 
first to obtain the innovation earns monopoly rents net of her R&D investment while 
the other firms who participated in the race earn negative economic profits. In this race, 
there is no simple (pure strategy) equilibrium because if one firms invests then everyone 
else should either invest nothing (not participate) or invest more in order to be the first 
to discover the innovation (on expectation obviously). 

The long-run dynamic view of the R&D process is that challengers who are barred 
from imitation by the patent protection will try to develop an even better innovation. 
Hence, if she wants to maintain her rents, the patent holder must retain technological 
leadership which means that she has to keep investing and innovating. This idea of 
a contestable market somehow justifies the temporary monopoly granted by the patent 
protection because the continuous stream of innovation greatly increases consumer wel- 
fare. The analysis of this dynamic interaction is beyond the purpose of this book; let us 
nevertheless report the accepted conclusion: 8 ® 

When all potential competitors stand on equal foot, aggregate expenditure on R&D is 
too large for two reinforcing reasons: there are too many firms and each invests too 
much. 

As argued by Posner, shortening the patent term would reduce the rent dissipation 



by reducing the revenue from a patent; it would also reduce the transaction costs of 
licensing, because more inventions would be in the public domain. 

12.2 The Pace of Innovation 

In this section, we study the incentives of firms to spend on R&D in various market 
structures: static monopoly, static competition and potential entry. Finally we address 
the issue of simultaneity in the innovation process. 

12.2.1 Innovation in a static market 

In this section, we assume that the market structure remains unaffected by the innova- 
tion process. 

Monopoly 

Following Arrow (1962), we analyze the incentives of a monopoly to invest into R&D. 
Consider a market for an homogeneous product with demand D{p). If, thanks to an 
innovation, the marginal cost falls from c to c then the consumer surplus increases by 
V* = J c c D{p)dp as indicated by the two striped areas at the bottom of Figure 12.2. 
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Figure 12.2: Replacement Effect 



The value for society of developing the innovation is exactly V* since in a competitive 
market the producer surplus is zero (both before and after the innovation development). 
If, on the other hand, the market is controlled by a monopoly his profit 11 = {p- c)D{p) is 
maximized for p^ 1 equating marginal revenue to marginal cost c. When the marginal cost 
falls from c to c, the profit increases by some amount V M , the grid area on Figure 12.2 
which is clearly smaller than V* . 9@ The intuition is clear-cut: for a small cost variation, 
the value of innovation is proportional to sales, whether the efficient ones D{c) or the 
monopoly ones D(p^f); thus the monopoly is less motivated to invest into R&D because 
his optimal production decision is much smaller. 

A monopoly underinvests into R&D because she values the innovation with respect to 
her limited sales while society values the innovation with respect to the larger efficient 
production. 

Competition 

Arrow (1962) also considers an initially competitive situation; he shows that the incen- 
tives to innovate for a competitive incumbent greatly improve upon the previous mono- 
poly case. More precisely, consider an initial situation where Bertrand competition pre- 
vails, thus forcing all firms to price at their marginal cost c and earn zero (extraordinary) 
profit (cf §5.2). When one firm innovates, her marginal cost falls from c to c and she en- 
joys profit V B = {c-c)D{c) because Bertrand competition among the other firms keeps the 
price at c. As can be checked on Figure 12.2, V B is equal to the vertically striped area 
which is V* minus the small horizontally striped triangle. 10@ 

I Under Bertrand competition, the value of innovation for a competing firm is lesser 
than the social value but much greater than the value to a monopoly. 

The underlying reason for this discrepancy is the differing opportunity cost of a mono- 
poly and competitive firm. In the competitive market, the innovator jumps from zero eco- 
nomic profits to almost monopoly profits while under monopoly, the monopoly is bound 
to replace himself, thus the "replacement effect" coined by Arrow (1962). 

The previous sharp conclusion is severely weakened if firms compete in quantity. 
Indeed, for the linear demand D{p) = a-bp, we can use the Cournot model with asym- 
metric cost of §5.1.2 to assess the innovation value. The pre-innovation efficient output 
is Q* = a - be, thus the value for society is roughly V* = {c-g)Q* (and half of this for a 
monopolist as she sells one half). In the symmetric pre-innovation duopoly, each firm 
earns § - 22% of the welfare. The innovation bestows its holder a cost advantage which 



turns into a higher profit because she sells more and enjoys lower cost. The innovation 
value, measured by the profit difference, simplifies into V c = | [a-bc){c-c). 

Imitation 

Protecting one's innovation from imitation is of the essence for a firm. Indeed, if all firms 
can adopt it in a regime of Bertrand competition, profits remain zero before and after 
which ultimately means that the innovation value if zero. Under the softer Cournot 
regime, the industry takes advantage of the innovation to sell more and earn more, thus 
the innovation value to a firm is her share of industry profit. In a symmetric duopoly, a 
firm earns § = 22% of the welfare so that the individual value V c = |V* is 22% of the social 
one, not a large figure but still more than zero. Using the innovation value when the 
implicit patent considered above, we can compute the WTP for the patent V c - v9 m = v9 m 
i.e., about 22% of the social value of the innovation. 

A corollary of these results is that the first welfare theorem does not apply wrt. inno- 
vation. Even when we start from the ideal situation of perfect competition (Bertrand), 
the incentives to innovate are lower than would be socially optimal. The reason is that 
innovation being costly, it must be protected by a patent to avoid imitation (and its dele- 
terious effect). But, once the patent is in force, the competitive market gives way to a 
less efficient monopolistic competition. Indeed, either the innovation decreases the pro- 
duction cost of the patent holder and he can translate it into market power (our model) 
or the innovation allows a greater differentiation (either in quality or characteristics). 
In both cases, the competitiveness in the downward market is reduced. 

Search 

Another dimension through which a competitive market structure fosters more innova- 
tion than a monopolized one is the trial and error process aka. the search for profitable 
innovations. Indeed, in a competitive environment, external industry wide chocks are 
filtered out when comparing profitability with a challenger. An oligopolistic firm thus 
gets better signals regarding whether an innovation is working or not. The monopoly, 
on the other hand, lacks this benchmarking or yardstick ability. Since a firm active in a 
competitive environment is better able to figure out the usefulness of a new strategy or 
technology, she is more willing to try them. We thus observe more and faster innovation 
in competitive industries. As a side result, we obtain an additional reason why a compet- 
itive or oligopolistic industry is more efficient than a monopoly (beyond the deadweight 
loss seen in §3.2). 



12.2.2 Patent Licensing 

Our plan in this section is first to motivate licensing, then examine basic strategies such 
as royalties and license fees, compare them, combine them, look at the outsider/insider 
distinction, compare different kinds of royalties and finally deal with the inherent com- 
mitment problem of the patentee. 

Introduction 

In an era of global competition, firms cannot rely solely on their own R&D, they must 
plan the external sourcing of technology. At the same time, licensing their own findings 
becomes an additional source of revenue (cf Zuniga and Guellec (2009)). We shall mostly 
deal with an independent laboratory that comes up with a cost reducing innovation, 
successfully patents it and designs an optimal licensing policy towards the members of 
the industry where the innovation can be used. An example would be the Dolby sound 
system for movie theaters. The basic strategies at the disposal of the innovator are 
selling licenses for a fixed fee, auction a limited number of licenses, set a per-unit or 
ad-valorem royalty or combine these basic strategies. The case where the innovator is 
an industry incumbent, aka an insider, is analyzed later on. 

The ideal strategy for the patentee would be to sell one license, have the licensee 
expel the rest of the industry thanks to its drastic efficiency improvement, let him set 
the monopoly price and earn the monopoly profit (associated with the new technology) 
and then siphon this payoff by way of the upfront license fee. In any model of oligopolistic 
competition for the end product, this happens only if the new monopoly price is below 
the old marginal cost of the industry i.e., when the innovation is drastic. 

Aside from this rather unfrequent case, the innovator is bound to earn less from her 
discovery. When considering basic strategies as exclusive alternatives Kamien and Tau- 
man (2002) show that, in the absence of uncertainty, the auction prevails over fees and 
royalties. The empirical prevalence of royalties can then be explained by issues such as 
asymmetric information, uncertain quality of innovation, product differentiation, moral 
hazard, risk aversion, leadership structure or strategic delegation. 

Royalties 

The royalty system is the most popular mean of licensing an innovation. Yet, from a 
theoretical point of view, it is a tax that inefficiently reduces the total output sold by 
firms, thus reduces the industry profit and ultimately the payment the innovator can 
extract from the industry. This claim, as well as the comparison with fees and auction, 
is developed within a model of oligopolistic Cournot competition. 



We consider a fixed market structure with n firms sharing the same technology char- 
acterized by marginal cost c. The innovation allows to reduce cost by 5. When the in- 
novation is licensed through a (per-unit) royalty r, a licensee achieves a per-unit saving 
5 - r. Thus, he is willing to license if and only if r < 8 as he can still compete with the old 
technology. Under such a condition, all firms license and the new Cournot equilibrium 
is q r = fl ~^~ r . The patentee thus earns n = nrq r , a quadratic expression in r whose 
unconstrained maximizer (cf §1.4.3) is easily found to be r* = a ~ c 2 +s . If 6 >8 = a-c, we 
have a drastic innovation and r* < 8 i.e., the ideal royalty can be used. For non drastic 
innovations (the usual case), the optimal royalty is the maximum one, the cost saving 6. 
Notice that the price and consumers surplus remains the same. Welfare improves by the 
total cost saving but firms fails to take advantage of it since it all goes to the patentee 
who earns n^ = -^ 6 {a-c). 

Per-Capita vs. Ad- Valorem As recalled by Vishwasrao (2007) royalties are more of- 
ten than not based on the revenue generated by the product incorporating the innova- 
tion, as opposed to being based on the number of units sold. Boccard (2010c) show that for 
both an outsider and an insider, ad-valorem royalties are preferred because they make 
licensing revenues increasing with the price which reduces the degree of competition in 
the market. This allows an outsider to extract more profit and an insider to strategically 
reduce her aggressiveness so as to sustain a higher price, a lower output and ultimately 
higher profits. 11@ 

Fees & Auction 

Under fee or auctions, any number m of firms can end up owning a license. Using eq. 
(5.14), the equilibrium with coexistence of licensed (L) and non licensed (N) firms is 
qN,m = a ~n+\' 5 f° r a non li censee an d qL,m = qN,m + 8 for a licensee. Observe that for 
m>fh = ^p, non licensed firms are expelled from the market as their output is driven to 
zero. 12@ Let us study first the fee and auction regime under non exclusion i.e., over the 
domain m< m. 

Under a license fee regime, a potential acquirer must compare her equilibrium payoff 
in two situations: either she is one of m licensees and earn q\ m or she stays out and earns 
q 2 N m _ 1 given that only m - 1 firms have the license. The WTP for a license, conditional on 
m being bought in equilibrium, is thus 

v m = ql m -q 2 N ,m-i (X (a-c + (n + l- m)5) 2 -(a-c-(m- l)8) 2 = 26 2 n[ 2m+ 2 n+2 - m) (12.1) 
Upon varying v m the patentee can attract exactly m licensees and earn n oc m ( 2m + w+2 - rn), 



thus the optimal number is m F = 2ot + w+2 (insofar as it is between n and m for otherwise 
the optimal value is whichever bound is hit first (cf §1.4.3).). 13@ 

When the patentee auctions m licenses, a potential bidder reasons that if he stays out, 
there will still be m licensees 14 ® and he will earn less than when there are m-\ licensees 
as above (i.e., qN,m-i > qN,m)- His WTP for the license is thus the greater q 2 L m - q 2 N m oc 
m + n Y L - m and the optimal number of licenses is m A = 2m + w+1 < m F (under the same 
proviso as above). Regarding profits, we have n F = m F \q 2 Ljn - q 2 N m _^ < rn F [q 2 Lm - q 2 Njn j 
because qN,m-\ > qN,m- Now, the latter expression is less than n A by the very fact that 
m A is an optimal choice. We have thus shown that auctioning dominates fees. 

When the patentee decides to exclude non licensed firms from the market by setting 
m>fh, competition only takes place between advanced firms (with cost c - 6), hence the 
more numerous the new industry is, the lower the industry profits, and the lower the 
patentee's revenue. It is thus never optimal to sell more than fh licenses. The truly 
optimal number of licenses is thus m A if m A <fho n<2m-l and m otherwise. 

Lastly, observe that selling m licenses (with a fee or an auction), we obtain q N = 0, 
qL = 8, and Q = fhS = a-c, thus p = c, as if there was perfect competition with the old 
technology. The patentee thus earns n = mq 2 L = 8{a- c) > n^ . This particular example 
shows why an ex-ante method is superior: it allows to reduce the degree of competitive- 
ness of the market, thus sustain a higher profit that is siphoned through fees or auction. 
This completes the proof of our claim that n A >n F >jiQ. 

Optimal Combination 

Once we allow the combination of basic licensing strategies, the optimal policy is to 
cartelize the industry using royalties with the entire industry as a price raising in- 
strument in order to diffuse competition and allow large profits that are siphoned back 
through license fees. 

Giebe and Wolfstetter (2008) study the optimal combination of royalties and auction 
for an outside laboratory. The optimal policy is to set high royalty to all so as to sustain 
a high equilibrium price and near cartel profit and then siphon part of this with the 
auctioning of licenses. A strategy for the patentee is thus to auction m licenses and set a 
royalty rate r L for licensees and a larger rate r N for non-licensees. 

As we already saw for the pure royalty case, 8>rm must hold in order to be effective; 
next we must have rAr > ri for otherwise no one would bid for a license. 15 ® Letting Rt = rtqt 
be the royalties paid by a type-i firm, operating profit is %i = ni + R t = {p - c + 8)qi. The 
patentee profit is 

n = m{r L q L + n L -n N ) + {n-m)r N q N = m{7t L -fi N ) + nr N q N (12.2) 



= mnL + {n-m)jiN-n7iN = ^Q- nn^ (12.3) 

where 7To is the industry operating profit, gross of any payment to the patentee. Notice 
that the formula is correct even when the patentee excludes some firms from the market. 
At this point, we notice that n L -n N in (12.2) is exactly (12.1) which means that the 
patentee will make more profit than by auctioning licenses alone thanks to the royalties 
paid by non licensees. Royalties paid by licensed firms are only useful to tune the end 
market cost conditions suitably. 

Meanwhile p < c in equilibrium, non licensed firms are crowded out and the equi- 
librium is symmetric among licensees. By (5.12), raising tl reduces total output but 
(unexpectedly) raises 7io because output is closer, though still larger, to the monopoly one 
when the innovation is not drastic. When r L reaches 8, all firms are paying the royalty 
5, thus have the same "old" marginal cost c i.e., p > c must hold in equilibrium meaning 
that no firm is being excluded under the optimal policy. 

Having proved that all firms are active, we see that raising r^v towards its maximum 
5 increases c, thus simultaneously increases industry operating profit 7r (cf. argument 
above) while decreasing obsolete profit n N by (5.14). Since this change unequivocally 
increases n Q , we must have r^ = 5. The patentee profit thus simplifies into n Q = Q{a - 
c + 6-Q)- nq 2 N with Q = mq^ + {n- m)q^ being market output. Working out (5.14) with 
cat = c and d- c-6 + Tl, we obtain q^ = ^f-p and Q = -^{a -c) + -^ where z = m{6 - Tl) 
so that 7r is a function of z only i.e., the optimal scheme is indeterminate. 16@ The FOC 
for optimality is 

Q J^l^Q«-Q^ = 2n qN a -^-o mM - Q K- 2 Jm 

dz dz dz n + l n + l 

=> Q M = Q- n q N = {a-c + zln-a + c + z) = z (12.4) 

n + l 

Since the patentee sets r L < 6, we have c < c i.e., the industry average cost falls; 
this means a greater output and a greater welfare (as payments to the patentee do not 
matter). It might even be the case that the royalty for a licensee is negative i.e., firms 
are subsidized to use the innovation in the end market. Indeed rj, < o {n - 1)5 < Q M o 
5 < ;r-% i.e., the innovation is small. 

2n-3 ' 

Insider Licensing 

We now study the optimal licensing policy of a patentee that is also an incumbent of the 
industry where the innovation is used. An example would be Sony with the Blu-Ray 
technology. Kamien and Tauman (2002) show that for a large oligopoly or equivalently 



for a large innovation, there is a reversal in the sense that royalties now dominate auc- 
tioning. 

The revenue for an incumbent of auctioning licenses does not change wrt. the outsider 
case when n is large. Indeed, the optimal number of licenses is m A = m which amounts 
to exclude non licensees from the end market. The price paid for the license is thus the 
operating profit of an incumbent firm (since her outside option is now zero). This means 
that one earns likewise as an insider getting a free license or as an outsider auctioning 
the license. From, q^ = 0, we deduce qi = qN+8 = 8, thus the auctioning profit is nf = mq\. 

The royalty scheme must be studied anew when the patentee is also an incumbent. 
We already know that a higher rate r increases her cost advantage over the rest of the 
industry, thus raises her output qi while lowering that of challengers q^; this means 
that her total profit nf - q\+{n- l)rqN unequivocally rises, thereby making r = 8 opti- 
mal. Using (5.14), we compute q^ = a ~^" 5 and qN = qL~8 = a ~ n X\ > thus the incumbent 
innovator earns nf = +m n ^. 2 n ~ + > nf o {m-l) 2 > 0. This proves the claim that 

royalties dominate the auction for n or 8 large. 

Regarding incentives to invest into R&D, an outsider having a zero opportunity cost 
has always greater incentives than an insider who already earn a Cournot profit. 

Commitment 

The licensing of the patented technology is prone to the hold-up problem (cf §14.2). 
Having granted n licenses, the patent holder is always tempted to sell further licenses 
(at a lower fee) which thereby depreciates the value of the existing n licenses since the 
original licensees now face more competition than expected. 17@ 

Such an expropriation is ex-post profitable for the licensor, but reduces his ex-ante 
profit. Indeed, if many downstream firms hold a license, intense competition destroys 
their profits, thus lowers their ability to pay the license fee in the first place. Therefore, 
a patent holder would like to promise that he will emit a limited number of licenses. 18 ® 

A similar point can be made for the franchising of a brand. Franchisees are unlikely 
to pay much to franchisors if they do not have the guarantee that competitors will not 
set shop at their doorsteps. The phenomenon at play here is lack of commitment similar 
to that of the durable good monopolist who cannot refrain from lowering its price as time 
passes (cf. §4.3.5). A solution to excessive licensing might be to integrate vertically with 
a firm competing in the product market. 19 ® Let us compute the optimal strategy between 
licensing to all competitors and integrating with one of them in a simple model. 

Imagine that an independent laboratory has develop a method to reduce the marginal 
cost of production from c to 0. If the patent holder integrates with one downstream firm, 
she achieves a cost advantage; in that case, she will not be tempted anymore to sell 



additional licenses to firms that compete with her. If on the other hand, she does not 
integrate and starts to sell patents then, as we argued previously, she will sell licenses 
to all n competing firms present in the market. Thus, Cournot competition takes place 
with different cost structures. 

We consider the simples demand is D = 1 - p. In the "all licensing" case, the Cournot 
equilibrium profit is n c = , n l 1)2 (cf. §5.1.3). The patent holder can therefore asks a fee 
F = n c and nets a total profit n llc = " 2 (this is a two part tariff rent extraction). The 
patentee would be better off issuing a single license if she could commit to it. If not, then 
she shall license to all n competitors in the downstream market. In the "exclusive licens- 
ing" case, the patent holder integrates with one firm and thereby achieve the necessary 
commitment. From that moment on, it is rational for her to refuse to license to others 
because this would lower her future profits in the downstream market. Competition thus 
involves an asymmetry of costs i.e., a unique leader and many followers. Applying the 
general result of eq. (5.14), we have q = l+[ "+\ )c for the licensee and q = ^f f° r the other 
firms, thus the aggregate quantity is Q = q + {n - Y)q = "~^"~ 1)c . Observing that the equi- 
librium price is q, the profit of the leader is simply n mt = q 2 = (1+(W ~H C) • The comparison 
we were looking for is n llc > n int o n < n = l^r) . To conclude: 

I Licensing dominates integration only when there is much market power but it is less 
likely to happen if the innovation is drastic (large cost reduction). 



12.2.3 Innovation Race 

Arrow (1962)'s results are criticizable because they fail to account for the Schumpeterian 
process of creative-destruction whereby old and inefficient incumbent monopolies are 
displaced by young and innovative entrants. 

Entry Threat 

Once we consider the threat of entry, the story change dramatically. The simple analysis 
of Gilbert and Newbery (1982) shows that the willingness to spend on R&D of a mono- 
poly is greater because he stands to lose much; a strategic analysis even reinforces this 
conclusion with hints towards an anticompetitive behavior of preemption. 

Consider an incumbent monopoly (he) and a challenger (she) ready to enter the mar- 
ket. If the monopoly innovates first, he will patent the innovation and most likely bar 
the challenger from entering. Indeed, the latter has a high cost technology, thus upon 
entering she will either earn zero profits if there is Bertrand competition or a low profit if 



there is Cournot competition. If her sunk cost of entry is greater than this level, entry is 
barred. The monopoly therefore enjoys the large profit n^ after having innovated first. 

The interesting case is when the challenger is first to innovates; she will then enter 
because she has a cost advantage. If there is Cournot competition, profits are respec- 
tively n9 and rig for the incumbent and the entrant with n9 < rig. We can now assess the 
value of being first to innovate (and patent). This value is V 1 = n^ - n9 for the monopoly 
and V e = n^ for the challenger. We saw in our analysis of oligopoly that producer surplus 
(aka industry profits) decreases with the number of active firms i.e., Ilg + uf < n^. We 
can thus conclude 20 @ that 

I The value of innovation is larger for the monopoly than for the challenger once we 
account for entry in the innovation process. 

How does the greater willingness to spend on R&D of the incumbent relative to the 
challenger impinges on their respective behaviors? It is clear that the more you spend on 
R&D, the quicker you will develop the innovation, yet the connection need not be linear 
because luck has a role to play. It is nevertheless useful to start with a deterministic 
relationship to immediately see that the incumbent will preempt the innovation. Indeed, 
the challenger won't spend more than V e on it, 21@ while the incumbent's own limit is the 
greater V 1 , thus he can surely beat her by spending slightly more and this leave him 
with a net profit of V 1 - V e = Yl^f - Yl c - Ilg which is the loss in producer surplus generated 
by entry 22 ® 

Policy Implications 

From an efficiency point of view, the innovation will take place at the right time (as if 
a competitive challenger would innovate) but since the monopoly will have succeeded to 
block entry, there is a welfare loss due to his pricing conduct. 

The policy interpretation of this simple result is that the monopoly may try to erect 
a strategic barrier to entry by over-investing into R&D to secure the best technology and 
accumulate patents; this attitude is popularly known as building a "thicket" or "bucket" 
of patents. The broader reading of this preemptive behavior reads as follows: if en- 
try occurs, the large technological advantage of the incumbent will force the entrant to 
differentiate either to a low quality or to compete with a higher marginal cost. As we 
previously saw in the corresponding models, the entrant's economic profit is low in both 
cases. This means she won't be able to recover her sunk cost of entry (her own R&D in- 
vestment), thus she will rationally abstain from entering, thereby leaving the incumbent 
free to price at monopoly levels and enjoy monopoly rents. 



Observe also that developing a new technology (not a cost reduction as in our model) 
or acquiring a patent does not necessarily mean that the monopoly will bring it to the 
market; in other words, a patent might be "sleeping". The innovation comes to the mar- 
ket with certainty only when entry actually takes place for otherwise it might be cheaper 
or more profitable to continue exploiting the good-old fashioned technology. The strategic 
barrier to entry we have just commented here is fully legal although it is inefficient since 
resources are wasted (from a social point of view) to protect a monopoly position. 

12.2.4 Market Rivalry 

In this section, we show how investments into a better technology (lower cost or better 
product) impinge on the market competition and how the intensity of the later feeds back 
the incentives to invest in the first place. 

Why innovate? 

We explain in §3.3.1 on quality that a consumer's WTP for an item depends on its per- 
ceived quality, whether a good or a service. It is furthermore shown that product devel- 
opment such as market research, innovative R&D, or advertising is akin to a reduction 
of the production cost (of the item under consideration). This cost reduction can proceed 
through many avenues such as improving management or customer care or simply up- 
dating the production technology, either by purchasing an advanced (patented) technol- 
ogy or through in-house R&D. In any case, the cost reduction activity displays decreasing 
returns to scale. 23@ 

We thus use "R&D investment" as a generic name for all the possible strategies that 
firms may use to reduce their cost or improve their product quality. Our aim is to study 
the determination of the optimal level of R&D and embed this issue in models of market 
competition to understand how the intensity and mode of rivalry affect the choices of 
individual firms. 

The direct effect on profits of a marginal cost saving is proportional to current output 
i.e., from n = {p-c)q we deduce A7T = qhc. This technological melioration also makes the 
firm look tough (hurts other firms), whatever the competition mode. The cost saving can 
also display a strategic effect if there exists a relationship between the equilibrium quan- 
tity q* and the marginal cost c. This is not the case under monopoly, perfect competition 
or price-setting regulation because there is no strategic interaction between firms and 
thus no indirect effect of a cost reduction. When the innovator competes in an oligopoly 
framework, there is a strategic effect because competitors respond to the firm's change 
of technology. As shown in §6.2.4 on business strategies, quantities are strategic substi- 



tutes while prices are strategic complements and since the cost saving makes the firm 
tougher, the strategic effect is positive in the Cournot case but negative in the Bertrand 
one. 24@ We show below that this dichotomy remains true though weakened. Cournot 
competition reinforces the natural willingness to spend on R&D by 33% in duopoly and 
up to 100% in large oligopolies while Bertrand competition reduces it by 33% in duopoly 
but no more than 15% in large oligopolies. 

The one technical assumption common to the various model exposed below regards 
R&D and is borrowed from Brander and Spencer (1983). We assume existence of a patent 
free technology whose constant marginal cost is c. To reduce her marginal cost by some 
amount x, a firm invests into R&D an amount y/{x) = Xx 2 for some parameter A that 
might be firm dependent. This way the cost reduction activity displays DRS. 

Quantity competition 

Given R&D effort jc,-, marginal cost is c\ = c-xt for i = 1,2. As we saw in §5.1.2 on Cournot 
duopoly competition, the profit of firm i is m = \q\ where the equilibrium quantity is 
(5.10) 

qi = -{a-2bci + bcj) = - [a - be + 2bxt - bxj) (12.5) 

The marginal benefit of innovation is thus 

dni 2 dq t 4 4 .„ 

^— = T^i^— = o<7* = -ia-bc + 2bxi-bxj) (12.6) 

0Xi b dxi 3 9 J 

which is 33% greater than the direct effect qi as we claimed above. For an oligopoly with 
n firms, it is enough to observe that the | coefficient in the duopoly equation (5.10) is 
replaced by -^ of (5.14) in the oligopoly case. Thus coefficient | in equation (12.6) is 
replaced by ^-. In a large oligopoly, the ratio tends to 2 meaning that firms have an 
indirect investment effect equal to 100% of the direct one(cf §26.1.5 of the appendix for 
the complete analysis). 

To simplify algebra in the duopoly case, we introduce parameter at = ^A* -2 in the 
R&D cost function; this way y/ t {Xi) = §(cr; +2)£>x? and the marginal expense of reducing 
the production cost is thus -^ = |(cr;+2)foxj. 

We can now characterize the optimal R&D investment by solving the FOC 

dm dy/ t 



dxt dxt 



o a-bc + 2bxt-bxj = (cr ; -+2)£>x; (12.7) 



The best reply of firm i against the choice Xj of her rival is xt = — g— — -. This best reply 
makes sense only if at > i.e., A ; is large enough. We observe that the strategic substi- 



tutability of quantities under Cournot competition translates into the R&D stage: the 
more my opponent invest to reduce his cost, the less I will invest myself (cf §6.2.4) . 
Given the symmetric formula for firm j, the equilibrium of the R&D game can be com- 
puted as 

a— he a i — 1 

(12.8) 
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This result makes sense only if the a parameters are both greater or both lesser than 
unity (equivalently that the A's are large) i.e., cost reduction are either hard or easy to 
achieve for both firms, which we shall assume. We notice that equilibrium R&D increases 
with market size (a) but decrease with the elasticity of demand (b). 

For identical R&D technologies (at = Uj - a), we obtain x = S^rk , thus q = ^^f^f > 
q c = 2 -y £ , the output under no innovation at all. We compute and n = \q 2 - 2 l {a + 2)bx 2 = 

{a-bcf {a+2)a _ ^C (<r+D 2 -l < ^C 
9b (,7+1)2 - n ((7+1)2 <■ n . 

The ability to innovate is taken on by firms who thus reduce their cost and produce 
more than previously. This is welfare increasing on counts of allocation ( greater con- 
sumption) and efficiency (lower unit cost for the industry). Firms however are trapped 
in a prisoner's dilemma since payoffs are greater for both when innovation is not an 
option. 

The problem faced by firms is that each has an incentive to be the first to innovate. 
Yet they end up investing too much i.e., more than the return they get from it. 

Price competition 

To assess Bertrand competition, we use the Hotelling model of duopoly price competition 
to avoid the difficulties associated with the Bertrand paradox. As shown in footnote 26.3, 
when marginal cost differ, the equilibrium price charged by a firm is p* A = t + 2CA 3 +Cg = 
t + c- 2xa + Xb with sales q A = \ + ^p- and profit n A = 2tq\. The total effect of a cost 

reduction is then 

ditA dqA 12 ,., _ ^ s 

— i =4tq A -^ = 4tq A - = -q A (12.9) 
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which represents a 33% decrease over the direct one. The extension of the Hotelling 
model to the circular city oligopoly (cf. §11.1.3 ) allows to perform a similar analysis. In 
§26.1.5 of the appendix, weshow that investment incentives are reduced by 20% for n=3 
and to no more than 15% in the large oligopoly limit (cf. for the complete analysis). 

We can now solve for the equilibrium of the R&D investment. Assuming that the 
same cost of investment y/{x) = Ax 2 for both firms, the best reply solves \q A = 2Xx A . In a 
symmetrical equilibrium x A = xb = x must hold, thus q A = qB = \ from which we deduce 



x* = gj and the final price p = t + c-x* . Profit is then n = tl2. In this simple model, 
profits are not impacted by the R&D efforts but since consumer enjoy a lower price, their 
surplus increases and so does welfare. 

Timing of Innovation 

A simple setting enables to draw interesting conclusions with respect to the timing of the 
decision to perform R&D. Imagine that each firm can engage into R&D at a fixed cost F; 
the probability of making a discovery is a and the market value of the discovery is V . 

The expected payoff for a monopoly is ni = aV - F and it is rational to undertake 
R&D if F < aV. Now, if there are two firms that can possibly engage into R&D the 
technological uncertainty is supplemented by a market uncertainty since the other firm 
may also discover the innovation in which case the market prize (cake) V would have 



to be shared equally. c The expected payoff for one firm is thus n 2 = a{\ - a)V + \V - F 



2 



so that R&D is undertaken by both only if F < (l - |) aV. For an intermediate R&D cost, 
one firm can successfully engage into R&D but not the other one. The innovation game 
between the two firms is one of coordination with a unique equilibrium up to the identity 
of the firm who does R&D. 

From a welfare point of view, we may consider maximizing the sum of profits. In that 
case we have to compare W\ = aV-F the welfare if a single firm does R&D (we neglect 
consumer surplus) and W 2 = 2n 2 the welfare when both firms do R&D. The cut-off is 
W 2 < Wi o F < {l-a)aV. We may thus conclude that individual decisions are efficient 
except for (1 - a) aV < F < (l - | ) aV i.e., the low cost of R&D push both firms into R&D 
while it would be optimal to have a single one doing it. 

12.2.5 Public Subsidies f 

As we already mentioned public subsidies for private research are frequent and involve 
large amounts. The US, supporting Boeing (B), and the EC, supporting Airbus (A), came 
to an agreement in 1992 according to which governments can back up loans up to one 
third of the development cost of a new aircraft. In October 2002, the US government 
accused the EU member countries to violate this agreement by lending to Airbus on 
better terms than the company could get from a commercial bank. The EC responded 
that Boeing cross subsidizes civil airplane development with military contracts. This 
bitter dispute after reaching a mutual menace of WTO litigation was settled late 2004 
by an agreement to dismantle the existing subsidies in the near future. 

The underlying reality of hidden subsidies can be understood using the model of Bran- 
der and Spencer (1983) who show quite clearly the distortion of competition that this 



subsidizing behavior can create. As we saw in §12.2.4 on innovation and market rivalry, 
firms competing in Cournot fashion have incentives to invest into R&D and reach an 
equilibrium level of R&D (cf eq. (12.8)) characterized by own technology parameters a, 
and Uj. The equilibrium profits net of R&D cost can be simplified into 

n* = liiixuxj) = -— ^-Lf-L (12.10) 

The symmetric situation where Uj = Oi = a is the most natural to start with; it occurs 
when firm finance R&D by leveraging funds on the world market at the same conditions. 
In that case, the common equilibrium level of R&D is x = £^k ■ 

It can be checked that the equilibrium level of R&D is greater than the level min- 
imizing cost because improving one's technology forces the competitor to reduce pro- 
duction and therefore adds an additional benefit. In that sense, the authors speak of 
over-investment in R&D. Yet, the truly efficient level of R&D has to maximize welfare 
net of the R&D investment. More precisely, welfare here is that generated in a compet- 
itive market where the good is sold at marginal cost; since marginal cost is constant, 
profits are nil thus the welfare reduces to consumer surplus computed at the marginal 
cost. Using the formula seen in §2.2.2, the consumer surplus at price a is Woic) = ~ 2fc Ci) . 
Hence, the welfare net of R&D cost is 

1 2 

W(x) = — (a- be + bx) 2 - -(a + 2)bx 2 ; 
2b 9 

hence the optimal value is x* = fferf] > 2x (check this using cross product), the amount 
expanded by the industry in the Nash equilibrium. We can conclude that the competition 
leads firms to invest sub-optimally into R&D. 

We now introduce governmental subsidies by assuming that each firm comes from a 
different country that is solely interested by the profits its national champion can gather 
on the world market and bring back home. To subsidize R&D, a government can offer 
a tax rebate that is to say, profits invested in R&D become deductible from income tax. 
In our model, this policy amounts to reduce the cost factor down to oi < a. Now observe 
that fti oc A = / g ' +2)g ' 2 (cf. eq. (12.10)), so that j& oc M- = -2 ? - " 7 " 7 '* 1 < 0. This derivative 
means that government i has an incentive to subsidize the R&D of its champion by 
decreasing the cr ; factor; in response to this fiscal incentive, firm i will increase its R&D 
Xi (check in eq. (12.8)), become a tougher opponent in the world market and earn more 
profit. 

The optimal support a* is difficult to characterize because we only known the benefits 
of subsidization while we ignore the cost of the public funds that are lost due to the 



subsidy (cf. §17.1.2). Nevertheless, government j, being himself rational, will react by 
subsidizing its own champion so that the overall levels of R&D will increase. 

Consider two firms pondering an investment into R&D aiming at reducing their pro- 
duction cost and become better contestants in the ensuing market competition. The first 
result of their model is that market competition "a la Cournot" leads firms to invest 
sub-optimally into R&D. 

Governmental subsidies are then introduced by assuming that each firm comes from 
a different country that is solely interested by the profits its national champion can 
gather on the world market and bring back home. To subsidize R&D, each government 
can free from income tax, profits invested back in R&D. The second result is that each 
government has an incentive to subsidize his champion although the optimal level is 
difficult to characterize because we only known the benefits of subsidization while we 
ignore the cost of the public funds that are lost due to the subsidy. Unless this govern- 
mental race overshoots the efficient level alluded before, the final amount of R&D is still 
insufficiently low which is in accordance with the findings of most empirical studies. Our 
last conclusion will therefore be that we should not bother for the EU-US dispute since 
these hidden subsidies bring us better planes and improve our travel experience. 

12.3 Intellectual Property Rights 

Technological innovation in the form of a less costly production technology or a new prod- 
uct, is at the root of all material progress in human societies. It is the random outcome 
of a very costly investment into R&D. When successful, the innovation bestows an ad- 
vantage in quality or cost upon its creator who becomes a better competitor, frequently a 
monopoly; the innovator is thus able to obtain extraordinary profits that enable him to 
recoup his initial R&D investment. As we saw in the chapter on R&D, innovations that 
are easily copied must be protected by legal means to maintain the incentive to invest 
into R&D. 

This protection issue was long recognized by governments of industrialized countries. 
To encourage innovation they issue patents that protect creators over the domestic mar- 
ket and enable them to earn monopoly rents for some period in order to recoup the cost 
of their R&D investment. The remedy has proved successful since some industries like 
pharmacy do not hesitate in spending as much as a quarter of there income into R&D. 

More broadly we are interested in this chapter by Intellectual Property Rights (IPRs) 
such as patents and trademarks; they give their developer certain exclusive rights over 
the exploitation of their work and Know-How. 26@ An example will help to grasp the very 
idea of intellectual property. To access the Nokia discussion forum on mobile phones, one 



must become a member and accept the terms among which the "License to Nokia for Any 
Submitted Content" stating (in a single sentence): 

By submitting any information or materials such as feedback, data, text, software, music, sound, photographs, 
graphics, video, messages, answers, questions, comments, suggestions, scores, hints, strategies, concepts, 
designs, ideas, plans, orders, requests or the like, or any other material (Content) to Nokia, you license and 
grant Nokia and its affiliates and sub-licensees a non-exclusive, royalty-free and free of charge, perpetual, 
worldwide, irrevocable, and fully sub-licensable right to use, reproduce, modify, adapt, communicate to the 
public, make available, publish, translate, copy, modify, adapt, create derivative works of, distribute, and dis- 
play such Content or any concept described in it throughout the world in any media, product and/service, 
including, without limitation wireless devices, mobile phones and any related products, services and acces- 
sories, advertising, marketing and promotional materials, and digital reproductions, without compensation, 
restrictions on use, acknowledgement of source, accountability or liability, and with waiver of all moral rights 
and rights of attribution, integrity and identity. 



12.3.1 Patent 

A patent is a set of exclusive rights granted by a government to an inventor for a man- 
made, technical creation with the purpose of encouraging the development of new inven- 
tions and their disclosure. 27@ Patents for new drugs are probably the most famous ones 
although it is not always the molecule that is patented as in the case of aspirin where 
the patent was awarded to Bayer for the industrial process of synthesizing the drug. 

The first 28@ patent law is the "Inventor Bylaws" awarded by the Venetian Republic in 
1474. England introduced its "Monopoly law" in 1624 and other industrialized countries 
followed in the next century. Today a patent confers for 20 years 29 ® the right to exclude 
others from making, using, selling, offering for sale, or importing the patented invention. 
As an alternative to legally monopolizing the market, the patentee can license the patent 
to other firms; in any case, the investment made during the development of the invention 
can be recouped. Three conditions must be fullfilled to obtain a patent. 

• Novelty: nobody else must have invented it before; the patent office checks this by 
performing a search of "prior art". 

• Non-obviousness: the invention must not be an obvious answer to a problem for 
someone who is skilled in that technological field. 

• Utility: the invention must be useful, have a practical application. 

After the expiration of the 20 years monopoly period, anyone is free to practice the 
invention and since disclosure is mandatory to obtain the patent, this is very easy to do. 
Thus, the patent system can be seen from two perspectives, either as a way to reward in- 
dividualistic innovators or a way to promote the diffusion of innovative knowledge in the 
whole society. The patent system is nevertheless the object of much debate. Regarding 



the protection provided by a patent, Arundel (2001) shows empirically that secrecy and 
lead-time (being first to market a new idea) are viewed by inventors, especially small 
firms, as a better protection from imitation. 

The information technologies are generating a problem around novelty and scope of 
patents. Owners of some old patents claim they cover some of the most widely adopted 
practices of the internet like hyperlinking, 30@ graphical formats, 31 ® or software code as 
in the most influential case where SCO Group, inheritor of some elements of the Unix 
operating system, sued IBM in 2003 claiming that IBM's Linux misappropriated the 
Unix code belonging to SC0. 32@ More recently, the US patent office has awarded patents 
for the translation of old business methods to the internet like shopping with a virtual 
cart, 33@ buying with a virtual credit card, 34@ or recommending items. 35 ® 

Jaffe and Lerner (2004) argue from their econometric analysis that the intensifica- 
tion of lawsuits is due to regulatory reforms from the 1980s. They converted the US 
patent system from a stimulator of innovation to a creator of litigation and uncertainty 
that threatens the innovation process itself. Indeed, firms fear to introduce products 
using well known devices or computer code because someone has succeed to patent it 
and is threatening to sue those who would not ask for a licence, which is likely to be 
expensive. As a result, development and production costs are increased. In a reaction to 
the SCO-IBM case, large software makers like Microsoft, Sun, Kodak or Novell recently 
announced they will cover the cost of litigation whenever one of their customers is sued 
because some products he bought contain copyright, patent, trademark or trade secret 
infringement. 

The EU is better protected against these misuses because the European Patent Office 
issues a public notice of information before granting the patent; this way, other parties 
can offer information relevant to the determination of novelty. The proposed new direc- 
tive on Patents seek to avoid any drift towards patents for business methods or computer 
programs which do not provide any technical contribution to the state of the art. The US 
system lacks this feature and is furthermore financed by the fees of applicants so that 
frivolous patents are frequently issued in a clear case of regulatory capture. The US also 
use a "first to invent" rule contrary to the "first to file" rule used in the rest of the world; 
this generates a lot of legal wrangling about priority. 

12.3.2 Copyright 

Copyright is the right to copy and publish (in a broad sense) a particular literary or 
artistic original work. It also covers copying in electronic form, the making of translated 
versions, the creation of a television program based on the work, writing computer pro- 



grams, and putting the work on the Internet. The copyright protects only the expression 
of an idea, so it is legal to take the plot from somebody else's mystery novel, and write 
your own mystery novel based on that plot. Most countries, including the EU and US, 
have adhered to the Berne Convention 36 ® granting automatic copyright protection for 
the artist's lifetime plus 70 years upon creation of the work. In some countries, regis- 
tration with a Copyright Office has some benefits like being able to sue, or to get bigger 
damages if you win. 

Copyright laws let authors decide how, where and when to exploit the works they 
created. Usually, works are sold in the form of copies on tangible media, such as books 
or compact discs. The author of a work can request a fee for every copy made of his 
work. He can also restrict the number of copies being made, for example to keep the 
work exclusive. 

Copyright protection is much longer than the patent one and generally involves a 
smaller if not a zero investment; it is supposed to be an economic incentive for the next 
producer of art, not a guarantee for the established one. A prime element of explanation 
for this difference with patents is the greater moral value put upon intellectual creations 
in comparison with technical ones, the second one is the necessity to avoid the congestion 
that would result for the diffusion of the popular works if they were free of royalties. 37 ® 

The copyright industries involve press, publishing, music, broadcasting, movies, the- 
ater, advertising and computer software; they contributed 5.3 % of the 15 EU GDP and 
3.1 % of total EU employment in 2000. Similar figures obtain for the US economy. More 
information is available on the EC internal market website. 

12.3.3 Trademarks 

A trademark is an exclusive right giving its holder the right to exclude (or stop) others 
from using the mark. It is mainly aimed at preventing unfair competition; it helps to 
identify good and services and to create a good image (in legal terms goodwill). Many 
counterfeit copies of trademarked goods enter the market in the hopes of confusing po- 
tential buyers. Because such counterfeit copies are typically of inferior quality with 
respect to the original product, they can seriously harm the value of the trademark and 
the image created. This explains why many trademark holders are very active in prose- 
cuting copy cats and destroying counterfeit goods. 

When a trademarked product lawfully enters the market (e.g. because the trademark 
holder manufactured it and sold it in a store), the buyer may want to resell it, either on 
the same market, or at some entirely different market in a different country or region. 
As selling a trademarked product is use of the trademark in commerce, the trademark 



holder could block this resale by claiming trademark infringement. As this is not always 
fair, various countries have developed an "exhaustion" or "first sale" doctrine regulating 
when a trademark holder can and cannot act against a reseller of 'his' products. 

The universality principle states that if the trademarked product legally entered the 
market somewhere, reselling the product anywhere in the world is not infringement. 
However, it ignores countries legal differences and was never applied for historical rea- 
sons (protectionism). The converse approach, known as the territoriality principle states 
that a trademark is legal only in the country where it is registered. 

In the USA, the first sale doctrine states that the trademark holder has full control 
until the first authorized sale within the USA. He cannot act against domestic resellers 
of products he put on the market himself or if the product was manufactured by a third 
party with his permission (a licensee). To sell Levi's jeans in New York, one can buy them 
anywhere in the US but not in Brazil unless the Brazilian trademark "Levi's" is held by 
the same company as the US trademark. Still the US trademark law can be used to bar 
this parallel import on the ground that the foreign product would be considered different 
from the domestic product by the American consumer (different quality). 

In the European Union, the Trademark directive supports a similar idea called the 
exhaustion principle: when trademarked products are put on the market in a member 
country with the trademark owner's consent, he cannot later oppose the import or re- 
import of these products into another country in the EU Furthermore the European 
directive do not permit member countries to have more liberal exhaustion laws, for ex- 
ample by allowing worldwide exhaustion. 

The Trademark directive has been refined by the ECJ in the Silhouette judgement of 
1998. Contrary to the US view, to be able to sell products bought outside the EEA into 
the EEA, the importer must have the unequivocal consent of the trademark owner. So, 
even when a trademark holder sells a product in Brazil without any restrictions, he can 
later block parallel import of the product at the European borders. But if he put them on 
the market in Belgium, he can't block parallel import to the Netherlands, nor a re-import 
from the Netherlands back to Belgium. 

12.3.4 IPR Cases 

Trade Secrets Litigation is an alternative way to seek profits since a court victory can 
entail a considerable market advantage. It is customary to see firms mutually threaten- 
ing themselves with legal actions. 

The Barbie vs. Bratz series of lawsuits from 2004 to 2010 is a case in point. Under 
most legal systems, all creation made by an employee within a firm belongs to that firm. 



Hence, when a person changes employer within the same industry, there are frequently 
lawsuits initiated by the former employer against the new employer relative to the illegal 
transferring of intellectual property by the defector. 

Mattel, the world's largest toy maker, has seen sales of its famous Barbie dolls suffer 
from competition by the edgier, urban-styled Bratz line which MGA launched in 2001. 38@ 
In April 2004, Mattel sued the doll designer for secretly working with MGA while still 
employed by them (back in October 2000); the latter countersued contending that his 
employment agreements with Mattel were invalid and the Bratz remained just an idea 
until after he left Mattel. In April 2005, MGA increased the stakes and sued Mattel, 
claiming its My Scene toy line was unfairly similar to Bratz and that Mattel was us- 
ing its clout with retailers to stifle competition. In November 2006, Mattel retaliated 
by accusing MGA of copyright infringement, misappropriation of trade secrets and rack- 
eteering, demanding in short full ownership of the Bratz doll. In October 2007, MGA 
hired a prestigious (and expensive) legal counsel. In July 2008, the designer settled 
with Mattel for an undisclosed amount. A week later, a federal jury ruled against MGA 
and awarded .lbn$ in damages to Mattel (out of 2bn$ requested). 39@ The crux of matter 
then was whether MGA owed Mattel profits derived from the first original dolls or from 
all the subsequent dolls and related products. In December, the judge ordered MGA to 
recall all Bratz dolls from retailers, to destroy "specialized plates, molds and matrices" 
used to make the dolls and to transfer all trademark rights in the Bratz name to Mattel. 
MGA appealed the ruling and in the meantime introduced an alternative line of dolls. 
In January 2009, the judge granted a stay of execution allowing MGA to operate during 
that year. In February, MGA changed stance and tried to settle the case with Mattel 
by selling them the Bratz line for less than .5bn$, but to no avail. In April, the judge 
ordered to hand over the control of the company to a temporary receiver only to overturn 
the decision in May while fastening the transfer of the Bratz ownership to Mattel. MGA 
appealed and in December, the court suspended the transfer order (i.e., MGA could keep 
selling its Bratz dolls) and finally reversed it in July 2010 stating: "It is not equitable 
to transfer this billion-dollar brand — the value of which is overwhelmingly the result 
of MGAs legitimate efforts — because it may have started with two misappropriated 
names". The entire case will probably be have to be retried. In this story, the money 
spend into building offensive and counter-offensive lawsuits neither improves the qual- 
ity of products nor positively stimulates demand. The two firms are simply engaged into 
a war of attrition with a view to either exclude the challenger from the market or get a 
bigger share of the cake (cf §7.4.2). 



DMCA The Digital Millennium Copyright Act passed in 1998 in the US aims at up- 
dating intellectual property rules for digital content. This extreme law has been used 
by private parties to exert control over their markets in a way never intended by its 
creators. 

In 2003, a Kentucky court granted a preliminary injunction to Lexmark International 
against a company that makes generic replacement cartridges for Lexmark printers. The 
court found that a chip in Lexmark cartridges that identify the refills as "official" could 
be protected under the DMCA, and thus, cannot be cloned. This is an obvious intent by 
a company to limit the access of its clients to less expensive ink cartridges (cf cases of 
illegal tying 24.1). 

Publishing results that show the weaknesses in the security of a software package 
can violate the DMCA. The most famous case is that of russian programmer Sklyarov 
who was arrested in California and jailed for 3 weeks in July 2001 after giving a speech 
about his company's software that could bypass protections on Adobe Systems' eBooks. 
Prosecutors argued the russian software violated the DMCA, which outlaws offering soft- 
ware that can circumvent copyright protections. The company faced charges related to 
directly designing and marketing software that could be used to crack eBook copyright 
protections, plus an additional charge related to conspiring to do so. The defense said the 
software was designed to allow people to make backup copies of eBooks they already own 
or transfer the material to a different computer. In december 2002, the jury acquitted 
Sklyarov's company of all charges. 

Trademark Levi-Strauss has been confirmed into its right to be the only seller in 
Europe of Levis jeans trough its networks of exclusive Levis shops. The British super- 
market chain Tesco was condemned by the european court of justice (ECJ) for buying 
genuine Levis jeans in the US and selling them in its stores. 40 ® 

One may point that the prohibition of parallel imports is not in accordance with the 
raison d'etre of trademark law, which is to assure the originality of goods and to protect 
consumers from piracy and fraud. Furthermore, the current EU legislation favors anti- 
competitive behavior by providing Trademark holders with the possibility to segment 
global markets and apply price discrimination. In the face of the argument that inter- 
national exhaustion would reduce the economic value of trademark rights and damage 
innovation, one should note that all the cases judged in this respect by the ECJ pertain 
to high end products whose reputation was long established so that investments were 
covered a long time ago (e.g., Levi Strauss, Davidoff, Sebago, Christian Dior). 

A few years ago, Microsoft filled a trademark lawsuit against Lindows, a company 
offering a low-cost Linux-based operating system that was compatible with popular Mi- 



crosoft file formats. Microsoft claimed that "windows" was clearly a trademark while 
Lindows argued that "windows" is a generic term for a certain type of software interface 
that predates the Microsoft product. MS successfully barred Lindows in a number of 
countries and finally came to an agreement with its potential infringer in 2004. 

For the record, Apple unsuccessfully sued Microsoft in 1992 for copying the look and 
feel of its Macintosh desktop software. In its successful defense, Microsoft argued that 
windows, icons and menus were developed in 1970s at Xerox corporation! 

Copying of Books and Writings Scientific publishers MIT Press, Elsevier and John 
Wiley have opened a lawsuit against a shop selling photocopies of their copyright pro- 
tected books to students at the University of Los Angeles. The case was settled with 
the shopkeeper agreeing to pay damages and honor all of the plaintiffs' copyrights in the 
future. 

A musician was once sued by the company holding his own copyrights for writing 
a new song too similar to an older song written by himself twenty years before (cf 
Columbia Law School list of recording artists copyright infringements). 

In still another case where real names do not matter, company A purchased a single 
subscription of a Market Analysis newsletter from company B but used to disseminate 
it to all its employees by photocopying, faxing and even posting an electronic version on 
its intranet. The content producer was awarded $20 million in damages. Likewise, the 
American Geophysical Union sued the Oil company Texaco in 1992 for illegal photocopy- 
ing; the case was settled before the start of the trial with the defendant paying a large 
(undisclosed) amount to the right holder (more examples). 

DVD Protection The media format known as Digital Versatile Disc (DVD) is the suc- 
cessor of the Compact Disc (CD). Initially, Sony and Philips advocated a disc with a 
single side of engraved data while a group of seven (mostly Japanese) firms advocated 
for a double-sided disc. Recalling the bitter competition in the 1970s between the home 
video recording format VHS and Beta (cf. §24.3.5) , the two sides reached consensus on 
hardware and software specifications and created the DVD Forum in 1995 to license the 
new standard. 

It has been customary for hollywood studios to stagger movie releases across conti- 
nents, first the US, then Europe and Japan a few months later and finally the rest of 
the world some months later; it was important for studios that DVD would hit markets 
in the same staggered way. Hollywood therefore required and obtained from the DVD 
Forum that each DVD player would be given a code for the region in which it would be 
sold. 41@ This meant that a disc bought in one country would not play on a player bought 



in another country. Such a feature could be viewed as an illegal vertical restraint of 
trade enabling unwarranted price discrimination, but since most DVD-players are now 
shipped with a hack removing the region coding, the issue has become irrelevant. 

DVD Copying A highly mediatized case of copyright infringement related to the DVD 
technology is DeCSS. The Content Scrambling System (CSS) is an encryption and au- 
thentication scheme intended to prevent the perfect digital copy of DVD content (collo- 
quially known as "ripping"). Although there is no charge to obtain a CSS license, it is a 
lengthy process. In 1999, only computers operating under the Windows or Mac systems 
could offer a DVD-player thus, in order to play a DVD on a computer equipped with the 
Linux operating system, a group of computer addicts "reverse engineered" the CSS code 
to produce a code breaker named DeCSS which was immediatly posted on hundreds of 
websites. 

The Motion Picture Association of America (MPAA) quickly reacted in court by invok- 
ing infringement of the DMCA but has failed to defend effectively its monopoly of DVD 
decryption. In 2003, Jon Johansen who posted DeCSS on the internet was acquitted by 
an Oslo court of the charges of theft. He admitted copying only legally purchased DVDs 
using the program, and the court ruled that he was entitled to do this. 

More generally, the decompilation of computer programs to achieve inter-operability 
has been ruled lawful in partial contradiction with the DMCA provision that forbids 
decompilation of programs protecting digital versions of copyrighted works; examples of 
such protection algorithm are present inside the WMA format from Microsoft and the 
AAC format from Apple to sell music digitally. 

DivX & MP3 As a standalone application DeCSS is useless for piracy but combined 
with a "DivX" encoder it enables to burn on a single CD a good quality copy of a DVD 
movie. This brings us to the second infringement, that of the patents for compressing 
music and video. The original DivX software published in 1999 combined the reverse 
engineering of Microsoft's proprietary compression algorithms for sound and video. To 
avoid legal problems, developers quickly turned to the MPEG-4 ISO specifications (cf 
§24 on standards) in order to develop trouble-free video compression algorithms. 

As for audio, the most popular format over the internet is the MP3. This lossy com- 
pression algorithm was developed in the late 1980s by the German Fraunhofer Institute 
using funding from the EC and became the audio part of the MPEG-1 ISO standard (cf. 
§24 on standards). The idea is to use models of human auditive perception to eliminate 
much of the redundant information stored in a CD so as to obtain the same listening 
experience but using twelve to twenty time less space (this technological constraint was 



quite relevant back in the 1980s). 42@ 

The reason why MP3 has become the digital audio standard is probably due to the 
initial lax enforcement policy of the original patent holders, Thomson Multimedia and 
Fraunhofer IIS. From 1992, date of the ISO release, until 1998, software developers 
have been able to explore the code to develop players (to listen MP3s on the computer) 
and encoders (to rip CDs into MP3s). By contrast, several other digital audio formats 
developed by companies like Lucent, Yamaha, and Microsoft, have kept their formats 
proprietary and limited how outside developers can employ their technology. Releasing 
the technology in the open has probably helped this standard to become dominant thanks 
to the user-friendly softwares that have been developed around it. Starting late 1998, 
the patent holders have been seeking licensing from commercial developers. 

Internet File Sharing Many recent cases of IPR infringements derive from the afore- 
mentioned "ripping" and encoding technologies which permit the sharing of digital ver- 
sions of songs and movies on peer-to-peer (P2P) distribution networks. With the ever 
higher penetration of high speed internet connection in households (e.g., cable, ADSL), 
the sharing of movies in digital format has increased steadily. 

Starting in December 1999, the Recording Industry Association of America (RIAA) 
battled for 18 months the Napster network which permitted the free sharing of copy- 
righted music in the MP3 format. The courts ruled in this case that the software devel- 
oper was directly involved in the copyright infringement because its server was perform- 
ing the searching and indexing of available files on the user's computers. Developers re- 
sponded with decentralized Peer-to-peer networks. In response, the RIAA and the MPAA 
started to sue individual users for direct infringement of the DMCA. A second strategy 
adopted by record labels and movie studios has been to sue developers for indirect in- 
fringement of the DMCA. Both the lower and the appeal courts ruled that peer-to-peer 
software developers were not liable for any copyright infringement committed by people 
using their products, as long as they had no direct ability to stop the acts. The case has 
gone since to the Supreme Court who ruled in June 2005 that file-sharing software com- 
panies could be held legally responsible for copyright infringement on their networks. 
Since then, many countries have enacted laws going in the same direction. 

Defenders of these P2P networks are quick to point at the similarity with the case 
of VCR (cf §24.3.5): Sony, a manufacturer of VCRs, was accused in 1976 by content 
maker Universal Studios of permitting the copying of copyrighted material. 43 ® The final 
judgement of the US supreme court in 1984 confirmed that "the non commercial home 
recording of material broadcast over the public airwaves was a fair use of copyrighted 
works and did not constitute copyright infringement". The underlying argument is that 



a distributor cannot be held liable for users' infringement so long as the tool (VCR or soft- 
ware) is capable of substantial noninfringing uses (cf review). Several famous professors 
of economics denounce the diversion of this judgment into the simplistic conclusion that 
one should never use indirect liability for products merely capable of substantial non- 
infringing use (fair use). They argue on the contrary that indirect liability is the optimal 
mechanism when direct deterrence is ineffective because of the high costs associated 
with identifying and pursuing individual violators. As in the case of online auctioning 
of prohibited items, the auctioneer is not held responsible as long as he acts expedi- 
tiously to remove infringing content as soon as he becomes aware of its existence. On the 
other side, Intel, the giant semiconductor manufacturer claims that expanding the scope 
of indirect liability would chill innovation and stifle the development of new products, 
including some designed to enhance lawful access to copyrighted works! 
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Chapter 13 



Firms vs. Markets 



This Part focuses on firms and their relations to markets; it also contains, for historical 
reasons, the theory of contracts that is of broader scope. 

Industrial Organization traditionally limited its scope to the interaction among firms 
in the marketplace. The interactions taking place inside firms were investigated by 
academics from management, sociology and organization theory. Yet, as shown on Table 
13.1, a large share of value creation (GDP) in modern economies is generated within 
firms, not on markets (cf also Lafontaine and Slade (2007)). 1@ For manufacturing, the 
ratio is one third, for services, it is twice that and on average, over one half. These 
stylized facts prove that firms are a worthwhile object of study for economists. 



US 


1998 


2003 


2008 


Agriculture 


40 


43 


43 


Mining 


55 


57 


52 


Utilities 


61 


58 


58 


Construction 


47 


50 


50 


Durable goods 


36 


36 


36 


Perishable goods 


33 


33 


28 


Information 


52 


52 


51 


Wholesale trade 


72 


70 


65 


Retail trade 


71 


71 


67 


Transportation 


52 


52 


48 


Finance 


55 


56 


55 


Real estate 


71 


70 


69 


Business services 


64 


67 


66 


Leisure 


56 


57 


56 


Education, Health 


61 


61 


61 


Government 


65 


62 


60 


Total 


55 


56 


54 



France 


1999 


2004 


2009 


Agriculture 


49 


46 


38 


Food, Tobacco 


24 


25 


21 


Energy 


33 


29 


25 


Construction 


44 


43 


44 


Consumer goods 


33 


30 


28 


Automotive 


19 


18 


16 


Equipment goods 


29 


28 


26 


Intermediate goods 


33 


30 


29 


Trade 


55 


53 


51 


Transport 


48 


47 


50 


Finance 


51 


49 


46 


Real estate 


78 


80 


81 


Business Services 


57 


55 


53 


Personal Services 


54 


53 


53 


Education, health, social 


79 


78 


77 


Government 


71 


70 


71 


Total 


51 


50 


50 



Table 13.1: Value Added as % of Production 



The first section centers on the firm and builds on the fact that productive activities 
are regulated only to a limited extent by prices and markets and much more by author- 
ity. We investigate the explicit and implicit incentives schemes at work inside firms to 
promote hard work and cooperation. The second section then contemplates various the- 
ories regarding the formation and evolution of the firm, including the "black box" one 
dominant in this book according to which the firm is a single rational minded economic 
agent called the "entrepreneur" interacting with other entrepreneurs on markets. 

13.1 Inside the Firm 

In this section, we briefly recall the legal nature of the firm and identify the basic in- 
centive problem residing inside the modern corporation, the fact that executives and 
workers' objectives need not be aligned with those of the owners. We then show how ex- 
plicit contracts like wage schemes enable to realign incentives. When opportunism, free 
riding or moral hazard cannot be eliminated using enforceable contracts, some implicit 
contractual arrangements are nevertheless available. Holmstrom and Tirole (1989) and 
Simon (1991) cite authority, rewards and identification (loyalty), external control and 
bureaucracy. 

13.1.1 What is a Firm ? 

Legal Forms 

A small business is called a (sole) proprietorship if owned and managed by a single indi- 
vidual while it is called & partnership if several people unite to own and manage the firm. 
Setting up a partnership is the most sensible and economical way to start a business; yet 
the owners are personally liable without limits for their firm's debts or commitments. 

When owners start to recruit managers to run the business on their behalf, they often 
turn their firm into a corporation, a legal entity able to borrow or lend money, sue or be 
sued and pay taxes. 2@ At the creation of a corporation, its shares are often held by the 
company's creators and a few backers (venture capitalists or family). If the firm grows 
successfully, it will need additional capital. This leads the owners to list their shares in a 
stock exchange where new shares are easily issued, quoted and traded. The corporation 
becomes public (not to be confused with public ownership by the state) and owned by 
dispersed stockholders (or shareholders) who range from the small individual investor 
to the gigantic pension fund. However, partnerships remain prominent in the areas of 
law, accounting, medicine, investment banking, architecture, advertising, and consulting 



because it is difficult for clients to assess the quality or ability of the people serving them 
(cf §21.1.2). 

The stockholders own the corporation but do not manage it; their active participa- 
tion is limited, in European law, to elect 3 ® an administrative board (directors) who then 
appoints top management and make sure that managers act in the shareholders' best 
interests (a not so clear concept to be elucidated later). This separation of ownership and 
management contributes to stabilize corporations both ways: if there's a change of man- 
agers (they quit or are dismissed), the corporation can continue its business and develop 
according to the desires of the board; likewise if a stockholder sell his shares to a new 
investor this will not disrupt the operations of the corporation. 

The legal definition of the corporation also encompasses limited liability of the own- 
ers which means that shareholders will lose at most the price they paid for their shares. 
The other side of the coin is that shareholders are only entitled to the firm's residual 
cash flow i.e., what remains after employees, suppliers, lenders and the government 
have been paid. The corporation has also some disadvantages when compared to a part- 
nership because there are bureaucratic costs linked to the necessary communication in 
the authority chain illustrated on Figure 13.1. There are also conflicts of interest in 
this chain which generate inefficiency costs. Formal contracts being unavailable inside 
the firm, some alternative methods are designed to cope with opportunism, free riding, 
moral hazard and hold-up; they are presented hereafter. 

Authority — »- — ■- — ■- 

shareholder manager supervisor employee 
Communication -« — — — -« — 

Figure 13.1: Communication and Authority Ladder 



13.1.2 Explicit Incentives: Compensation 

A craftsman has always the right incentives towards hard work because he is the resid- 
ual claimant of his efforts. Motivating his apprentice toward the maximization of the 
shop's profit is also quite elementary using a bonus for good performance or the promise 
to inherit the workshop later on. 4@ Transposing this result to the more general and ubiq- 
uitous modern corporation where employees almost invariably work in teams is more 
difficult. We comment here some issues that are developed analytically in §20.2 of the 
chapter on incentives. 



Team vs Individual Compensation 

Paying an employee according to a performance measure reflecting his individual con- 
tribution is the cheapest way to provide him with incentives towards effort. Also, if the 
observed signal is precise, the risk exposure of the agent is low so that incentives are 
cheaply provided (cf formal model of §20.2). Whenever agents work in teams, the per- 
formance measure necessarily includes contributions from other people which mechan- 
ically increase the risk exposure of everyone; incentives thus become more expensive to 
provide so that team compensation would look inadequate. 

However, joint accountability enriches the performance measures available to the 
manager, which helps to mitigate multi-task problems. To understand this issue, re- 
call that the technological advantage afforded by specialization also means that agents 
perform their specialized activity in different places or over different products. The ex- 
ception is line assembly where the worker applies his specialized effort repetitively and 
can thus be paid at a piece rate to guarantee optimal effort. A scientist or manager, how- 
ever, will typically work over at least two projects simultaneously and the productivities 
of these tasks may not be perfectly reflected in the observed performance measures. 

Put differently, the profit of a product line is the sum of profit attributable to, say, 
R&D and marketing; yet revenues, an observable measure, may magnify the marketing 
activity with respect to the R&D one. This means that if an employee works only in 
that line of business and his wage is linked to revenues and tuned so as to give him 
adequate incentive to expand effort in R&D, he will mechanically expand too much effort 
in marketing. Now, if the agent performs the same task, say marketing, in different 
product lines, his wage can depend on the two revenues and can be tuned so as to give 
him the right incentives to perform the efficient time allocation among the two tasks. 
However, he will be exposed to a greater risk since his wage will become more variable. 

In §20.2.4, we develop Corts (2007)'s model exposing this arbitrage between risk and 
multi-tasking. We conclude that when the degree of risk aversion is low, or when signals 
of effort are informative or when the multi-task problem is serious, team work dominates 
individual accountability. 

Relative vs. Joint Performance Evaluation 

Having shown that team work can be the optimal organization inside the firm, we can 
delve further into this idea. We seek to uncover optimal performance measures and to 
assess whether an employee's compensation should be tied to the performance of his 
peers. 

One option is to link positively the wage to the group's performance with joint perfor- 



mance evaluation (JPE) i.e., set a bonus linked to the division's profit, the output rate 
of the assembly line or the group-based lending adopted in the Grameen bank (micro- 
credit). The other option, relative performance evaluation (RPE), takes an opposite point 
of view and recommends to penalize the employee (relative to others) if the entire group 
fare well. This is a form of contest as studied in §7. For instance CEOs are more likely 
to be dismissed from their jobs after bad industry and bad market performance indicat- 
ing that their peers performed better The "Tour de France" cycling competition displays 
both kind of evaluations. Every day, the first runner passing the finish line gets a time 
reward, the second a smaller one and so on; this is a case of RPE. On the "team" day, the 
time of each runner is the time of the latest from the team to pass the finish line; this is 
a case of extreme JPE where teammates take great care of the weakest among them. 

The economic theory of incentives (aka. moral hazard) largely focuses on short- 
term relationships between employees and employers; it advocates the use of incentive 
schemes that are sensitive to individual performance measures and induce competition 
among workers for instance via tournaments for access to a higher position (cf §13.1.3). 
Moral hazard in teams extends the previous focus to the employee's peer group. If the 
performance measure of workers has a common noise component, then RPE can be at- 
tractive since it insulates the workers from the risk of common shocks and thus gener- 
ates a stronger incentive than other schemes. This allows the employer to provide incen- 
tives at low cost. Yet RPE, by entailing extreme competition among team members, fails 
to foster cooperation. In environments where workers interact closely with each other it 
leads to an inefficient peer pressure: the group forces a "rate buster" to reduce his output 
so as not to make other team members look bad. In many activities team work has been 
implemented to foster cooperation among employees via frequent and long-term interac- 
tion, via empowerment (decentralized authority) and via peer monitoring and no outside 
supervision. Incentives to effort are given through a JPE wage scheme. 

The model of interaction between employers and employees developed in §7.4.3 comes 
to the following conclusion. 

For short term relationships, it is optimal to use RPE i.e., to pay someone when his 
signal is good but when the partner's signal is bad. The key to this result is that a 
good signal is more informative of effort if the partner's signal is bad. Thus, paying 
a bonus solely in that asymmetrical situation generates an inexpensive motivation to 
work. 

Long term relationships call on to use the opposite JPE scheme because it is important 
to avoid retaliation by workers who could be penalized by the RPE scheme. To guar- 
antee effort during the whole duration of the relationship, a mutual bonus is optimal. 



13.1.3 Implicit Incentives: Motivation 

Motivation refers to all the incentive schemes that go beyond short run piece rate; it 
includes authority, rewards, identification with the firm, external controls and special- 
ization. 

Authority 

Two features of employment relationships are key to understand its widespread use. 
Take the example of a shirt manufacturer and its dressmaker. Firstly, there is uncer- 
tainty for the employer: shall he produce white or black shirts? Fashion will tell! Sec- 
ondly, the employee is likely to be obedient because he is indifferent about the color of 
the fabric to be sewed. Those elements makes the employment contract very attractive 
for both parties because it saves on transaction costs, there is no need to negotiate a con- 
tract for the furniture of white shirts and another one for black shirts or similarly there 
is no need to go back to the local trading post each time the manufacturer needs to buy 
a new bundle of shirt in the latest fashionable color. 

But the authority does not only issue commandments; more often it asks a result to 
be produced ("repair this sewing machine"), or a principle to be applied ("sew body before 
sleeves"), or goal to be achieved ("sew as cheaply as possible consistent with quality"). 
There is delegation not only of the task but also of the method used to perform the 
task. Doing the job well is not so much responding to commands but rather taking 
initiative to advance organizational objectives. To be sure, obeying rules literally is a 
favorite method of work slowdown, an alternative to strike used by all sorts of controllers 
(customs, fiscal). 

The employer's ideal command would be "always decide in such a way as to maximize 
company profit" but checking its obedience is almost impossible without losing the ben- 
efit of delegation. Even if the employees were (loyal) robots, complete discretion would 
lead to chaos and would surrender the large efficiencies attainable by specialization in 
decision-making work. We need to delegate within guidelines, which creates the prob- 
lem of monitoring the observance of guidelines without recentralization what has just 
been delegated. Indeed, to ascertain and judge the decision criteria of the employee, the 
employer would have to review the whole decision process thus completely losing the 
benefit of delegation. The solution is then to motivate the employee in order to be able 
to trust him. 



Rewards 

Employees may be motivated to accept authority and contribute to the organization's 
goal by giving them monetary rewards, promotions, or perquisites. The "workaholic" 
behavior of many managers is well explained by their career concerns and the implied 
lifetime income stream (cf §20.2.5). However reward schemes work only if contributions 
are measurable; it is then clear why wages are linked to sales, piecework or the firm's 
profit. Yet if a key characteristic like quality of the product is not observed, the incentive 
scheme may cause quantities to grow at the cost of lowered quality. Likewise safety rules 
may be ignored or problems passed to other divisions within the firm. 

It seems obvious that rewards systems should be aligned with desired behavior but 
management theorists tell us that this is not common practice; instead there is an 
overemphasis on "objective" criterions, quantifiable standards, highly visible behaviors 
and also hypocrisy by the rewarder claiming that the target was not met although it was. 
Some examples, inspired by Kerr (1975), are: 

• Economy and prudence should rule governmental budgeting but the biggest budgets 

are allotted to the biggest spenders. 

• Voters punish candidates who frankly discuss where money will come from; thus they 

vote for acceptable but void goals. 

• Physicians tend to over medicate patients (label a well person sick), because the re- 

verse error (labeling a sick person well) has far severe consequences although both 
kind of errors should be minimized. 

• Academics ought to manage the dual responsibilities of teaching and researching but 

since in most countries they are rewarded for one task only, they tend to neglect 
the other one. 

• In sports, it is team performance that matters but individuals who obtain rewards 

such as fame and endorsement contracts. 

An enduring system of motivation is found in yardstick competition and tournaments 
whereby a group of employees are benchmarked one against the others. Promotion is 
an extreme case wherein a large prize is awarded to the best performer. It has proved 
to be an efficient system of motivation and to make the prize of promotion even more 
attractive to the many contenders, many firms pay their top executive more than her 
marginal productivity. The threat of dismissal can also discipline workers in a firm but 
only if they face a possible unemployment period or a lower wage in the new job. Hence 
firms pay experienced people much more than youngsters to increase the difference in 
expected salary (efficiency wage theory). 



Since rewards tend to turn employees into competitors, the cooperative attitude so 
important for the firm's success might be lost; it would therefore be dangerous to rely 
solely on rewards as a source of motivation. 

Identification 

We do not observe much free riding or shirking inside firms; instead many employees 
display pride in work and loyalty to their organization. Identification with the struc- 
ture is a widespread behavior that induces employees to accept organizational goals and 
authority as bases for their actions. 

Natural selection can explain why such a behavior is common. Darwinian models 
and experiments support neither the idea that nice guys can make it to the next gener- 
ation nor for the idea that people only pursue selfish personal economic goals. Instead, 
the observation that humans depend on the surrounding society for nutrition, shelter, 
safety and apprenticeship has lead theorists to predicts the appearance of docility (be- 
ing tractable, manageable, and teachable) as a key to building fitness and improving 
survival. Darwinian fitness calls for a substantial responsiveness to social influence: 
motivation to learn or imitate, willingness to obey or conform; hence Simon concludes 
that docility is not altruism but enlightened selfishness. 5@ This topic is also discussed in 
§24.5. 

Docility is used to inculcate individuals with organizational pride and loyalty. Iden- 
tification with the "we", which may be a family, a company, a nation, or a sports team, 
allows individuals to experience satisfaction from successes of the unit they belong to. 
Thus, organizational identification becomes a motivation for employees to work actively 
for organizational goals. Furthermore our bounded rationality acts as a reinforcing feed- 
back; indeed we cannot grasp the complex reality in its entirety and therefore focus our 
attention on specific aspects among which the goals of our organization. 

External Control 

As stated above, the well functioning of a necessarily decentralized organization rests on 
the active participation of subordinates to make constructive propositions and take value 
enhancing decisions. When instead, they interfere to promote their own well being or 
their career, they generate so-called influence costs. 6@ Beyond the arguments of internal 
discipline seen above, several external forces make managers "behave" , several of which 
appear as models in §23.3. 

• Labor market discipline states that misbehavior almost always end-up noticed, thus 
made public which ruins the manager's reputation and his ability to find another 



job. 

• For investments, managers may have a shorter or longer horizon (reputation) than the 

firm; they may end up taking sub optimal decisions. Stock-options are then a good 
proxy of an option on the firm's value which is also a good proxy of an option on the 
manager's reputation. 

• Product market discipline argues that whenever there is competition it is possible to 

judge a manager's performance against that of competitors, hence one can write 
incentive contracts. This comparability also enhances the reputation concern of 
the manager (cf. §3.2.2). 

• Capital market discipline tells us that incumbent managers must do their best to max- 

imize the firm's value because otherwise they will be ousted by a hostile take-overs 
and replaced by better able and willing managers. 

Bureaucracy 

Weber (1922) characterizes bureaucracy as the centralization of administration into a 
corps of well trained professionals relying on the division of labour within the organiza- 
tion, a pyramidal authority structure and impersonal rules that regulate the relations 
between organizational members. 

The key of a bureaucracy is its system of control based on rational rules meant to 
design and regulate the whole organization on the basis of technical knowledge in order 
to achieve maximum efficiency. The specialization of tasks is achieved by an extensive 
definition of the duties and responsibilities of each individual. Hierarchical relations 
are impersonal and authority is legitimized by the efficiency of the administrative rules, 
unlike feudal order where relationships are personal and based on the sacredness of tra- 
dition. Hence loyalty is oriented towards an office and not towards who holds it. The 
bureaucrat is not selected on basis of family or political loyalty but on formal qualifi- 
cations related to his specialized duty; 7@ also the position cannot be sold or pass to 
an heir. 8@ Entering the bureaucratic organization is for a "life's work" i.e., to make a 
career. Remuneration ignores short-term productivity by relying on a fixed salary but 
stimulates long-term productivity through promotion for achievement; it also fosters fi- 
delity by accounting for seniority. 

The popular view of bureaucracy is often that of "red tape" and inefficiency. To explain 
this perversion we have to notice that if bureaucracy implies reliability and predictabil- 
ity it also lacks flexibility and tend to turn means into ends. Indeed, the emphasis on 
conformity and rigid rules induces the bureaucrat to internalize them; the formal aspect 
of bureaucracy becomes more important than the substantive one, the efficient working 



of the organization. 

On the theory front, Niskanen (1968) shows that a bureau tends to overproduce wrt. 
the efficient situation where marginal cost is equated to social WTP (cf. §17.2.4). Since 
public services have near natural monopoly attributes (cf. proof in §17.2), delegation to 
the private sector (or privatization) would create the de-facto monopoly whose tendency 
is opposite i.e., lower output so as to charge a higher price. This typical "Chicago school" 
author, is nevertheless aware that privatization is no solution unless an effective regula- 
tion forces the awardee to behave in the people's interest. The question then is whether 
a (of necessity public) control agency is better at regulating a bureau or a concessionaire 
(cf. §16.4.1). 

More recently, Acemoglu et al. (2008) explain why pensions, health care and educa- 
tion are managed by the State. The social aim of schooling is to increase human capital 
by adequate teaching but privatizing schools introduces yardstick competition. First, in- 
stitutions wastefully advertise to attract customers. Secondly, they are judged on char- 
acteristics observable in the short-run meanwhile the social objective is a long run one. 
Hence, schools substitute socially valuable investment by visibility enhancing ones. On 
the human capital side, teachers and school managers train children like monkeys to 
succeed at state exams. Law enforcement displays the same problem with over-powered 
incentives under yardstick competition. Regarding pensions, a private fund manager 
will distort the composition to display short-run profitability when the long-run is what 
matters. Regarding health, private hospitals tend to substitute medical equipment by 
amenities to attract more clientele instead of focusing on curing them. Even within state 
agencies, as soon as bureaucrats are rated per output, they start to go after easy preys 
(may be innocent people) instead of socially harmful ones. For instance, IRS agents pur- 
sue divorcees who failed to pay the pension to their ex-wife instead of trying to catch 
big-time defrauders. All of these cases point toward low powered incentives characteris- 
tic of the bureaucracy in order to sustain a good result (though clearly second-best). 

13.1.4 Internal Organization of the Firm 

Functions vs. Products 

During the first half of the XX th century, firms took advantage of scale economies by orga- 
nizing themselves around functions or processes i.e., they had a different departments for 
marketing, manufacturing, purchases, finance, engineering, R&D, legal matters, human 
resources and so on. Chandler (1992) identifies market leadership and the building of 
a competitive advantage as the result of a three-pronged strategy: investment in large- 
scale production to lower unit cost, investment in marketing, distribution, and purchas- 



ing networks and finally, recruitment and organization of professional managers. There- 
after, many firms became conglomerates producing a range of vastly different products. 
Under such an organization, most decisions are taken at the company's headquarters, 
ensuring better coordination of activities across departments but also leading to conges- 
tion at the top. This system emphasizes the importance of inputs and takes advantage of 
the specialization of labor; at the same time the identification of contributions to overall 
profits is made difficult which can result in demotivation, lack of incentives but above all 
confuse the allocation of cash-flow towards the best opportunities. 

The opposite organizational form focuses on the outputs or products i.e., emphasize 
consumer vs. business clients, US vs. European geography, Civil vs. Military use, agri- 
cultural vs. industrial specificities. This classification leads the firm to group comple- 
mentary tasks into self-contained divisions. 9 ® Thanks to this decentralization, the better 
informed local managers are taking better decision in the day-to-day operations but also 
wrt. long-term strategy. Under this latter organization, the contributions of divisions 
are easier to identify thanks to analytical accounting and therefore easier to reward. Be- 
ing quasi-autonomous profit centers, divisions compete for the firm's cash-flow in what 
Williamson (1975) called a "miniature capital market" with potential gains in efficiency 
over external markets deriving from manager's superior knowledge of the firm's oppor- 
tunities compared to that of investors or bankers. 

The tension between the two organization forms can be illustrated as follows: when 
the VW automaker bought Seat and Skoda in the 1990s in addition to its original brands 
Volkswagen and Audi, it intended to run the 4 firms as independent competitors each 
with a distinct image. The group then quickly realized that important scale economies 
could be achieved by designing platforms upon which models of the 4 brands could be 
build; roughly speaking some elements of the cost of designing a new car would cut 
by a factor 4. That was a move back towards a functional organization. This integra- 
tion process was successful and heightened at about 60% of the components of a car. 
The downside was too much similarity of the models that came to compete one against 
the other instead of competing against outside brands. The current strategy of the VW 
group seems to be a strengthening of the brand image of each subsidiary, thereby re- 
ducing significantly the commonalities inside the group (a move back to the divisional 
organization). 10 ® 

Authority vs. Information 

Another fundamental characteristic of a firm is its distribution of authority in relation to 
the information structure, that is to say, who is to take the coordination decisions when 
the activities of the firm are hit by a shock ? Maskin et al. (2000) and Qian et al. (2006) 



study this trade-off in a simple model where the product is successfully completed (sold) 
only if all complementary parts fit together (e.g., car, computer, congress organization, 
software or an audit). Although operations follow a plan, there are always unexpected 
(exogenous) contingencies that create either a mismatching called an attribute shock or 
a line failure called a capacity shock. Examples of the former kind would be when a piece 
does not fit or a new kind of piece is needed while examples of the latter kind would be 
when a piece does not come in time or the product does not sell anymore. These shocks 
call for adjustments and require coordination given the complexity of the whole process. 
It is at this point that the organization of authority and information circulation can 
make a difference. There are only two authority layers, one CEO and two division man- 
agers. We then assume that division managers have perfect information with respect 
to shocks hitting their division but none with regard to other divisions while the CEO 
receives a more or less distorted information of all divisions. The authority structures 
are thus : 

• Decentralization : the division managers take coordination decisions. 

• Centralization : the CEO takes all decisions in which case the firm shape is irrele- 
vant. 



The concepts of internal organization can be represented schematically. Imagine an 
entertainment firm selling two items, videos and music with two processes, production 
and sales. The two organizations are represented on Figure 13.2. 
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Figure 13.2: Two Organization Schemes 



In the simplified scheme of Figure 13.2, the decentralized product focused organiza- 
tion (M-form) is more efficient if there are only attribute shocks (mismatching) because 
in the decentralized U-form firm, no adjustment can be carried out for the failure to com- 
municate and because the centralized form suffers from an imperfect communication. If 



there are only capacity shocks then the decentralized U-form firm dominates the M-form 
one as soon as there are some storage costs. Otherwise the M-form organization can 
anticipate the effect of any shock with the following policy "transfer 50% of the expected 
need of the other division if i am not hit by a shock myself". When no shock occurs trans- 
fers cancel out and if only one division suffers a shock it is insured by the automatic 
transfer of the other so that production are equalized in the two divisions. When both 
type of shocks can occur centralization is likely to become the best organization form if 
the communication channels are sufficiently efficient. 

Summarizing, a process focused firm copes well with capacity shocks because she is 
good at managing substitution or reallocation of resources from a product line to another 
when needed. A product focused firm conversely copes well with attribute shocks since 
all processes are affected by the shock and the close integration of those processes within 
the division allows their managers to coordinate on an effective reaction. To conclude, 
the internal organization of a firm will evolve in reaction to the evolution of consumer 
tastes, technological changes such as the information processing enabled by the use of 
computers and networks or the better specialization of tasks enabled by robotization. 

13.2 Contract Theory 

In this sub-section, we explain why real life contracts are so incomplete. The reasons 
we advance borrow from the transaction cost and property rights theories of the firm 
studied in the next section. An primer on the topic is Salanie (2005), while Bolton and 
Dewatripont (2005) is a more exhaustive reference. 

13.2.1 Incomplete Contracts 

Contracts are useful to make commitments, to delegate task, to allocate decision and 
control rights; their ultimate goal is to allow for more efficient exchanges (in a wide 
sense). As the examples displayed below demonstrate, contracts are often incomplete in 
the sense that much is left to be decided later on. 

Market Whether at the city market or at a shop, the buyer picks an item, pays the listed 
price (or bargain a little over it) and walks away with the item. The exchange con- 
tract is complete in the sense that nothing we may think of is capable of derailing 
that simple interaction. 

Haircut The posted price is agreed by the customer and the beautician, but the for- 
mer gets to specify "how she wants it to look," and the latter gets to decide on the 
instruments and how they are used. 



Restaurant For the listed price on the menu, the contract gives you the right to receive 
one serving of the dish and eat it at your table. It is tacitly understood that you can 
decide how fast you eat, how loud you talk, and how big a mess you leave behind, 
while the restaurant can decide on how and from which ingredients the dish is 
prepared, how soon it is served, when you get your drinks, which napkins you are 
given, who sits at the other tables, and so on. 

University The contract explicitly gives the buyer the right to attend classes and use 
libraries for an academic year; he has also discretion regarding timing and selection 
of the classes to attend. In turn, the university can decide which courses are offered, 
when and where they are offered, and who teaches them. It also decides on the 
contents of the libraries, the temperature of the pool, and so on. 

Custom-built house The contract includes a large set of architectural drawings spec- 
ifying particular production methods and materials. However, a tremendous num- 
ber of decisions are still left out. The customer can make decisions on colors, fix- 
tures, and a number of other decorative items. Conversely, the builder can decide 
on the brands or suppliers of several materials, as well as the exact location of 
many nails, joints, and boards. 

Let us then try to classify what should be put into a contract and why less is in fact 
used. Firstly, it is difficult or costly to fully specify the contracting parties' obligations 
because 

• Parties must speak the same (business) language. 

• Writing a contract is time consuming. 

• Future contingencies are difficult to describe, thus to incorporate. 

• Contingencies must be observable by both parties for otherwise they'll disagree 
regarding what happened. 

• Contingencies must be verifiable by an external enforcer such as a judge or a private 
arbitrator for otherwise they can't be enforced. 

• Wealth constraints and the protection afforded to parties by limited liability put 
limits on the use of money to elicit the parties' willingness to pay for specific services 
or decisions. 

• Collusion: complex contracts use parties to check on each other's truth-telling and 
obedience; there is then an incentive for parties to side-step the contract and collude 
against its enforcer to avoid penalties. 

• Enforcement being provided by human beings, the latter may be lazy or fail to 
understand the contract or let their own preferences (or legal precedents and prin- 



ciples) bias their decisions or even collude with the parties against the spirit of the 
contract. 

• Ex-ante asymmetric information among the signatories forbids them to include ev- 
erything that might be relevant in the contract since the better informed party 
prefers to maintain his informational advantage. 

• Renegotiation: cf below 

For these reasons, contracts are incomplete and only specify general rules of behavior 
or decisions and control rights such as who is to decide what to do if some specific event 
occurs or who is to decide on how to use an asset in the future. 11@ As noted by Tirole 
(1999), one must understand that simple institutions such as authority, property rights, 
and patents are popular because they have good robustness and learning properties i.e., 
they are not too far of the mark when the parties make mistakes in their view of the 
world or become less rational and they are universal so that one can learn how to use 
them in a context and apply them in a different environment. 12 ® 

The major problem generated by incompleteness is renegotiation. As events unfold 
and new information becomes available, decisions must be taken to achieve the objectives 
of the relationship; in other words, the parties need to constantly update and adapt their 
plans. There is little doubt that the parties will be able to identify the efficient course of 
action, that which maximizes the value of their relationship. By acting efficiently instead 
of following a default behavior (that thought to be correct ex-ante when the contract was 
designed), they generate a quasi-rent (cf. §2.4.3 and Fig. 2.9). The problematic issue is 
its division among the parties. 

In the TCE vision, parties cannot help but haggle over the quasi-rent, each trying 
to get his way. In the end some of that wealth is wasted i.e., there is a deadweight- 
loss. Integration of the parties under a common hood can then alleviate this problem by 
replacing negotiation with authority, assuming that the intensity of haggling is reduced 
when parties are hierarchically related. 

In the PRT vision, the Coase theorem holds (cf. §2.4.3) which means that some mid- 
dle ground is immediately found and there is no deadweight-loss at the outset of the 
bargaining over the quasi-rent. The problem then is "hold-up", the fact that if one party 
has to share the fruits of her efforts with the other, then she has less motives to invest 
into meliorating the relationship (cf. §14.1.3 and §14.2). 

13.2.2 Simple Contracts 

Real life contracts are remarkably simple in that they frequently omit potentially useful 
and feasible provisions; they willingly ignore verifiable information or avoid to specify 



specific behaviors. In particular, a lot of decisions made during the life of a contract are 
absent from it and instead tacitly delegated to one of the parties. For instance, employ- 
ment or service contracts essentially focus only schedule, duration and compensation. 
At most, a vague notion of the mission to accomplish is stated. Thereafter, the principal 
orders the agents a task for which the latter has considerable latitude to execute (cf also 
the examples above). 

Wernerfelt (2007) explains that attempts to economize on bargaining costs imply that 
two parties may write a contract which is incomplete in the sense that each party tacitly 
cedes some decision rights to the other. Recall indeed that if future decisions are jointly 
taken, parties will reach an efficient decision but will also have to bargain over the 
division of the quasi-rent which is wasteful (under the TCE vision). 13@ On the other 
hand, if one party is delegated the decision right then no haggling takes place but she 
will fall short of efficiency for two reasons. Firstly, she will lack the input of the other 
party and secondly, she will sway the decision towards her personal best. Beyond the 
obvious role of the bargaining cost which calls for delegation, a party is more likely 
to cede the decision-right over an attribute if she does not care much for it, has less 
information about it or has little difference of opinion about it. Furthermore, inso far 
as the decision taken is not irreversible, the excluded party can call for a renegotiation 
when she feels that the candidate decision is too far from her ideal. This threat will force 
the decision maker to behave in the first place and thus yield a better outcome, closer to 
the jointly optimal one but without any actual costly negotiation. 

Kessler and Leider (2010) offer a behavioral explanation 14@ to contractual simplic- 
ity based on norms: a simple "handshake" over a norm of good behavior ("do the right 
thing") is enough to motivate the parties to abide by the spirit of the contract and provide 
a "consummate performance" when the most that an enforceable contract could achieve 
would be a "perfunctory performance". 15@ The underlying driver is the fact that people 
suffer for violating the norm they choose to follow. Thus, negotiating a high norm results 
in parties take high intensity actions close to the ideal level required for the success 
of their relationship. 16@ In games with strategic complementarity, the norm sensitivity 
must be high to compensate the natural proclivity to undercut the partner's action. It 
is also shown empirically that establishing a high norm is an effective substitute for an 
enforceable restriction. This is so because an enforceable restriction allows parties to 
justify living down to the letter of the contract, rather than fulfilling its spirit. Parties 
thus see the inclusion of a minimum requirement as implicit permission to take only 
that action, a notion known as "crowding out" of intrinsic motivation. 17@ Given this sub- 
stitution effect, the optimal contract is extreme i.e., either trivial and norm-oriented or 
meticulous and clause-oriented. 18@ 



13.2.3 Flexible Contracts 

The alternative, following Hart and Moore (2008), is to consider not simple vs. complex 
but flexible vs. rigid. By definition, more is known to the parties ex-post than ex-ante, 
so that more can be contracted upon. Yet, many issues remain unverifiable i.e., although 
parties observe what's going on there is no way to assert it in front of an expert. Each 
party is thus left free to perform as she sees fit. The span of moral hazard or free will 
is even larger if the contract deliberately delegates some contractible actions to a party. 
To assess how parties perform those out-of-contract actions, we take a behavioralist ap- 
proach. If there is trust, each party will abide by the spirit of the contract and perform 
in a consummate manner. 19@ If, however, one party feels "shortchanged" or aggrieved, 
she will retaliate by sticking to the letter of the contract and giving a perfunctory per- 
formance, thereby creating a deadweight-loss (wrt. optimal performance). For instance, 
a seller can cut quality, delay delivery, exploit ambiguous terms, contest facts, refuse to 
cooperate or make a last minute change or threaten to do any of these things. They key 
is that the retaliatory action is cheap for him but expensive for the buyer. Apart from 
quality, the buyer can behave in a likewise fashion. 

How can mistrust appear ? At the ex-ante initial contracting stage, parties trust 
themselves and see the contract as balanced and just. 20@ Indeed, being negotiated un- 
der competitive conditions, the contract constitutes a salient reference point because 
market forces define what each party brings to the relationship. Once the parties are 
immersed in their bilateral relationship, social norms of behavior and performance lose 
some of their grip and the self-serving bias kicks in. 21@ Each now tends to claim for 
herself any wealth newly created, especially the quasi-rents that appear during the life 
of the contract. Whatever the sharing rule in use (either exogenously fixed ex-ante or 
endogenously bargained ex-post), at least one party will feel aggrieved and will generate 
a deadweight-loss by its revengeful attitude. 22@ 

One way to put an end this inefficiency is to reduce quasi-rents by rigidifying the ex- 
ante contract so as to eliminate most if not all ex-post adaptations which are the source 
of quasi-rents. The downside is obviously the inefficiency of such a contract unable to 
adapt to changing circumstances. This trade-off then helps to understand why parties 
tend to put restrictions (disallow renegotiation) on conflicting variables such as price 
or compensation but leave more room for more consensual ones such as the nature or 
characteristics of the good to be traded or the task to be executed (e.g., I care about my 
salary but not so much whether I teach micro or macro or whether I teach in the morning 
or the afternoon). 



13.3 Theories of the Firm 

13.3.1 Organizations vs. Markets 

The Puzzle of Organizations and Markets 

Before reviewing the most important theories relative to the formation of the firm, Si- 
mon (1991)'s tale will help set the analysis framework: A martian approaches the Earth 
from space, equipped with a telescope that reveals social structures. Firms (and pub- 
lic institutions) reveal themselves, as green areas with faint interior contours marking 
out divisions and departments. Market transactions show as red lines connecting firms, 
forming a network in the spaces between them. Within firms the approaching visitor 
also sees blue lines of authority connecting bosses with workers. Our visitor might see 
one of the green masses divide, as a firm divested itself of one of its divisions or it might 
see one green object gobble up another (acquisition or merger). 

No matter whether our visitor approaches North America, urban China or Europe, 
the greater part of the space below him would be within the green areas, for almost all 
of the inhabitants would be employees, hence inside the firm boundaries. Organizations 
would be the dominant feature of the landscape; a rough description would speak of large 
green areas interconnected by red lines instead of a network of red lines connecting green 
spots. Hence our visitor would be quite surprised to hear the structure called a market 
economy instead of the obviously more appropriate organizational economy. 2 ^® 

The constant progress of technology since the start of the industrial revolution has 
enabled firms to achieve great economies of scale and scope; their size has increased in 
relation to markets which is why most of the population work as employees or managers 
but few are entrepreneurs. 

Economics of Organization 

As rightly argued by Gibbons (2005), theories of the firm ought to be called "theories of 
the boundary of the firm". In this section, we pit the neoclassical view of a firm as be- 
ing defined by production constraints against the more recent proposals of institutional 
economics, namely the property rights theory (PRT) and the transaction cost economics 
(TCE). None of these views are incompatible, rather, they differ in scope as one can no- 
tice with Table 13.2 borrowed from Williamson (2000). Four levels of organization are 
distinguished; for each, we indicate the time frame of its evolution, the objectives (not 
necessarily explicit) it tries to achieve, the nature of outcomes and lastly the theory fit- 
ting best the case. 



Level 


Foundation 


Institution 


Governance 


Transaction 


Time 


century 


decade 


year 


continuous 


Objectives 


religious beliefs 


social rules 


private rules 


rationality 


Output 


norms 
customs 


law 
bureaucracy 


contracts 
organization 


price 
quantity 


Theory 


sociology 


property rights 


transaction costs 


neoclassical 



Table 13.2: Social Organization and Time 



The foundation level has no clear origin, it seems to appear spontaneously, being 
driven mostly by religion; it is always taken as fixed by economists (except historians). 24 ® 
According to North (1990), institutions are the humanly devised constraints that struc- 
ture human interaction. They are made up of formal constraints such as rules, laws 
or constitutions and informal constraints such as norms of behavior, conventions and 
self imposed codes of conduct (descendant from the foundation level). Together they de- 
fine the rules of the game i.e., the incentive structure of societies. 25 ® A classification of 
institutions pertinent for our purpose is: 

• Political institutions, including the constitution, the legislative and judicial branches, 
form the superstructure above the government. 

• The government determines economic policies on the short-term and designs eco- 
nomic and political institutions for the long term. 

• Economic policies such as wealth redistribution (taxation, subsidization) or public 
investment are decisions implemented using of economic institutions. 

• Economic institutions cover the areas of law regarding contract, commerce, tort, 
patents, property and finance to name a few. 

The fundamental property of institutions is that they outlived the people who are 
part of them; they bring stability and credibility to the rules they establish for all actors 
whether the government or firms. Regarding the organization of economic activity, the 
proponents of the property rights theory (PRT) claim that in the absence of a well de- 
signed legal system, chaos dominates. The purpose of defining and enforcing property 
rights is to eliminate warlike competition for the control of economic resources and re- 
place it with peaceful-non destructive-market competition. Private property rights are 
then the (human) right to use an asset, appropriate its returns, change its form, sub- 
stance, or location. The PRT is nevertheless demanding as it relies on a far reaching and 
complex law to decide in every possible circumstance who holds the property right and 
on a powerful State to enforce this right. It therefore assumes no transaction cost. 26@ 

It is however clear that technological progress and the advent of specialization have 
changed the structure of the economy; the relevant variables to be contracted upon are 



more and more complex so that property rights become more and more difficult to en- 
force. As a consequence, contracts are necessarily incomplete and parties are forced 
to adapt in the face of unforeseen events. In the course of renegotiating their original 
agreement, parties haggle over the value of what needs to be adjusted. There are cost of 
transaction because each party is at the same time an opportunist predator and a pray 
to his partner's opportunism. 

Transaction costs economics (TCE) then stresses the need to go beyond the descrip- 
tion of economic institutions and focus instead on the actual unfolding of the economic 
activity. Attention is shifted on the attributes of the transaction and the properties of 
alternative modes of governance, where the latter is understood as an effort to craft or- 
der i.e., to mitigate conflict among parties in order to achieve the mutual gains from 
exchange. The last level is the neoclassical one dealing with marginal analysis, the fine 
tuning of prices and explicit incentives, the fact that prices and output continuously 
respond to changing market conditions. We now proceed to review the three economic 
theories in the reverse sense. 

13.3.2 The Neoclassical Firm 

The premise of this theory is the technological frontier relating inputs and output studied 
in §2.1.2. It is known as the "black box" since its internal operation fall outside the realm 
of economics. It assumes that both can be bought and sold on competitive (spot) markets 
so that price taking behavior is always optimal. Given that output is sold at a constant 
market price, the profit maximizing firm has the adequate incentives to minimize its 
costs by choosing the best combination of inputs, applying the correct managerial effort 
and so on. 27@ The result is the cost function with its variable and fixed parts. The optimal 
production equates price with marginal cost but takes care to cover fixed cost i.e., price 
has to be greater than average cost. 

The average cost curve is often pictured as U-shaped because as soon as the firm oper- 
ates it has to pay some fixed costs for plants and machines. Over some production range, 
the marginal cost is constant because duplication of output only requires duplication 
of inputs in a fixed proportion. Thus the average cost is decreasing. There is however 
a scale of production beyond which the duplication of inputs like preferred location or 
managerial talent becomes difficult, then marginal cost rise and so does average cost. 
This gives a simple explanation to the limited size of a firm. 

Large output levels enable firms to specialize employees in specific tasks, to apply 
more efficient processes and invest in cost reducing techniques. Typically, a plant with 
several production lines will not stop producing when a failure occurs because the inputs 



destined to the failing line can be reassigned to other lines. Likewise the supply of many 
(more or less independent) markets rather than a unique one guarantees the firm an 
almost constant demand in the short term (insurance from conjuncture shocks). Those 
arguments are captured in the neoclassical economic theory by the ideas of scale and 
scope economies (cf §2.1.5). 

Hart (1995) recalls that this theory has successfully linked production decisions with 
prices of inputs and outputs, explained the aggregate behavior of an industry or the 
strategic interactions when output markets are not competitive (oligopoly). Still this 
theory says nothing about the size of a firm or about its internal organization. True, a 
manager's talent is limited but hiring a second one should solve any duplication problem 
related to this specific input. In a nutshell, the neoclassical theory is a theory of plant 
size not of firm size. Indeed, the legal definition of the firm relates to the ownership of 
capital, thus among other thing plants. A single tycoon could own all plants on the globe, 
each one having an ideally chosen size. Alternatively, all plants could be independent 
firms. In fact any combination is compatible with the neoclassical theory and this is why 
it fails to explain how firms are born, grow and die. 

13.3.3 Transaction Costs Economics 

Transaction Modes: Make or Buy 

Starting with Coase (1937), economists started to argue that institutions such as firms 
or public agencies serve the purpose of facilitating exchange and can best be understood 
as optimal responses to contractual constraints rather than production constraints. 28 ® 
The original quandary regarding the nature of the firm is "make or buy": should an input 
be bought on the market or made inside the firm? Should a product be sold to a retailer 
or directly proposed to the end-user ? The options contemplated here are backward and 
forward integration. If the firm goes this way, it is bound to grow in size since it will 
need more assets to produce internally. 

The major difference between the two modes regards the relationship of the involved 
parties. External procurement through the market involves a business relation between 
a seller and a buyer who stand on equal foot. The market price paid for the item provides 
parties with incentives to behave adequately; they end up maximizing the joint value of 
their relationship. More generally, contractual clauses establishing payments contin- 
gent on some specific performances are incentive devices. Internal provision involves an 
authority relation between a boss who orders production and a subordinate who carries 
on the order. 29@ The integration of a productive asset therefore shifts the terms of the 
relationship from a price mode to a quantity mode. More generally we can classify the 



way a firm buys an input or sells its output: 

• Spot market: buyers and sellers meet anonymously to exchange standardized prod- 
ucts. 

• Bilateral relationship: parties sign a long term contract specifying in a detailed 
manner how they make and exchange a product. 

• Integration: one party of the previous relationship buys the other to switch the 
relation mode from price to quantity. 

The choice of one mode over other alternatives obviously responds to monetary con- 
siderations (i.e., cost vs. gain). Whenever, an activity takes place between independent 
firms, there are so called transaction costs while there are management costs if the ac- 
tivity takes place within the firm. A simple marginal substitution argument tells us 
that the limit to the size of a firm is reached when the management cost of marginally 
extending the firm's activities is equal to the transaction cost. 

At this point, it would seem that internal and external provision are fundamentally 
different. This is not so because all the transaction modes previously described are gov- 
erned by contracts, the market transaction being the simplest of all; it is only the items 
contracted upon that differ. Even so, there remain many similarities. For instance, a 
sales and labour contract both stipulate a quality, a quantity and a price. In the former 
case, quality refers to the description of the good to be produced while in the former case 
it refers to the job design. Quantity then can refers to output to be traded (sale) or input 
to be provided (labour). Lastly, the price can incorporate cash or in-kind transfers or 
perks. 30@ The difference between the internal and external modes will therefore lie in 
finer details regarding what is observable and verifiable with sufficient precision to be 
incorporated into a contract. 

Contract theory study how contracts are designed and what purposes they perform. 
A very active branch is agency theory (cf Part H) which summarizes the firm as a 
bilateral relation between a subordinate, the agent and a supervisor, the principal. Its 
premises are that effort is unpleasant or costly and difficult to observe. To guarantee 
that employees or contractors work hard, it is thus necessary to pay them in function of 
their output. The agency theory has elaborated on this basic story to explain the nature 
of incentive contracts that are observed within firms, between firms or between firms 
and governments. 

Yet this theory does not put a limit to the number of contract the owner of the firm 
can sign with its employees or commercial partners. Once again, it fails to pin down the 
boundary of the firm. Also, the fact that a buyer and a seller are linked by a contract has 
no implication on their legal relation; they may be independents firms or divisions of a 



single firm; hence the agency view is consistent with the existence of a unique huge firm 
in the world or infinitely many small ones. It tries to explain the working of relations 
inside firms or among firms but not the fundamental question of this chapter "why do 
firms exists ?" 

Transaction Costs 

In an attempt to answer this quandary, Williamson (1975, 1985) has developed the 
Transaction Costs Economics (TCE) out of two observations pervasive of bilateral con- 
tracting. Firstly, it is difficult or costly to fully specify the contracting parties' obligations 
as we explained in §13.2. Secondly, parties develop "relationship-specific assets" of the 
following kind: 

• Site specificity: parties commit immobile physical assets. 

• Physical and Human asset: parties invest in specialized physical assets or human 
capital. 

• Dedicated assets: parties build productive capacity for which there is insufficient 
demand absent their trading relationship. 

In the presence of relationship-specific assets, parties become "locked in" to one an- 
other over the course of their relationship because the next best use of their specific 
assets is low. Indeed, site specificity raises transportation costs, specialized assets have 
a high switching cost and dedicated assets have a low rate of return. The "lock-in" then 
generates ex-post "quasi-rents", because the value of trade within the relationship far 
exceeds the value of outside trading opportunities (cf §2.4.3 on bargaining and in par- 
ticular Figure 2.9). 

When both contractual incompleteness and assets specificity are present, renegotia- 
tion is bound to occur so that the contracting parties face ex-post opportunistic behavior: 
each may engage in inefficient rent-seeking in an attempt to "hold-up" the other party 
and obtain a larger share of the quasi-rents (cf. §16.3 for a detailed introduction to rent- 
seeking). Whenever the two independent firms are merged, it is the common boss of 
the two divisions who decides how to share the quasi-rent, thereby reducing the quib- 
bling of the formerly independent division managers. Our study of rent-seeking in §7 
(pl93 and p62) supports this view on two grounds. Firstly, asymmetrically empowered 
rent-seekers, such as the boss and a division manager, jointly spend less in wasteful 
rent-seeking activities than if they are symmetrically empowered (bargaining between 
two bosses). Indeed, since the boss has a comparative advantage, he can fire the man- 
ager, it is not worthwhile to seek rent with great effort for the manager, so that the boss 
can also reduce his own effort. Integration, in this respect, gives more power over assets 



to one party and takes away power from the other party. Secondly, the bargaining over 
quasi-rents by two independent parties often involves asymmetric information regard- 
ing rent-seeking abilities. In that case, there is a welfare loss because parties expand 
effort to seek rent. Integration then eliminates this waste. 

Even if the sharing of quasi-rents does not lead the parties into wasteful rent-seeking, 31( 
the incentives towards investments into specific assets are diluted because each is a par- 
tial claimant to the fruits of his efforts; under-investment then results. A merger then, 
has the potential to improve upon this situation since the common owner is residual 
claimant. This point is at the heart of the property right theory (PRT) developed in the 
next paragraph. Whinston (2003) argues also that the merger improves the coordina- 
tion of investment and trading but increases bureaucratic costs. A central prediction of 
the TCE is that greater absolute levels of quasi-rents increase the likelihood of vertical 
integration (when contracts are incomplete). We study several models of TCE in §14.4. 

The wider implication of TCE is that the choice of an optimal governance structure 
moves from "anonymous market transactions" to "administrative rules" as difficulties 
appear such as complexity of the product, information asymmetry, moral hazard, hold- 
up, absence of property rights. Each move in the chain of Table 13.3 entails more secu- 
rity, less incentives and more bureaucratic costs. 



spot market -» short term contract -» long term (incomplete) contracts 
— integration into private firm -» regulated firm -» bureaucracy 



Table 13.3: From Market to Bureaucracy 



Formalization 



To understand the impact of "make or buy" " onto the production technology, Nickerson 
and Vanden Bergh (1999) recur to the dichotomy between mass andjob-shop production. 
On the one hand, a job-shop utilizes flexible re-deployable tools albeit with a limited 
output capacity (per unit of capital equipment). Mass production, on the other hand, 
develops inflexible speedy machines that increase production capacity. If capital expen- 
ditures are identical, the job-shop has a smaller production capacity and more generic 
capital equipment than mass production. Increasing the asset specificity then amounts 
to switch towards a larger capacity and it is then possible to identify asset specificity 
with total output of the firm. 

Now, the "make or buy" choice boils down to comparing two traditional cost curves. 
From the discussion on transaction cost, we may assume that the fixed cost of using 
standardized market exchange is less than the fixed cost of setting up a hierarchical 
relationship. Conversely, the additional cost of adapting to an increased specificity is 



greater when organized in the market compared to when organized via hierarchy. But 
such a cost variation is that due to a larger capacity requirement which is usually known 
as the marginal cost. If the two cost curves cross, as on Figure 13.3, the market procure- 
ment is optimal for low levels of specificity (or low sales volume) while the hierarchical 
organization dominates for high levels of specificity (or high sales volume). Notice that 
the cost curve which is effectively observed has a kink and the property that when hier- 
archy is in use, marginal cost is low and fixed cost is high while the reverse holds when 
the market is used. Thus, comparisons of the two modes using descriptive statistics can 
suffer from sample selection bias . One mode should not be compared to the other, but 
to a counterfactual scenario for without this correction, it may falsely appear more effi- 
cient and cost effective than the other (cf discussion at the end of §14.4.1). Lastly, for 
situations where one curve is entirely above the other, there is an unconditional optimal 
organization of production. 




Hierarchy Q 



Figure 13.3: Optimal Internal Organization 



Empirical Findings 

There are few econometric studies on the TCE but they support it fairly well. Masten 
(1984) studies the procurement decisions of a large aerospace company over 1,887 com- 
ponents. The two variables having a positive effect on the likelihood of integration are 
the degree of component complexity which measures the difficulty of making complete 
contracts and the degree to which the component was specific to this firm. He also found 
that a combination of the two features reinforces the probability that the component 
would be made "in-house". 

Monteverde and Teece (1982) explore the level of internal versus external procure- 
ment for 133 components used by GM and Ford in 1976. Unlike Masten (1984) , they 
do not focus on physical asset specificity, but on the (un-patentable) know-how that is 
generated during the design development. In their words: "The existence of transaction- 
specific know-how and skills and the difficulties of skill transfer mean that it will be 
costly to switch to an alternative supplier. An assembler (GM or Ford) will tend to choose 



vertically integrated component production when high switching costs would otherwise 
lock the assembler into dependence upon a supplier and thereby expose the assembler to 
opportunistic re-contracting or to the loss of know-how". The key variables are the level 
of engineering effort to measure know-how acquisition and the specificity of the compo- 
nent (whether usable in other cars). These authors find evidence that increases in both 
variables raise the likelihood of integration. 

Joskow (1985) studies the coal procurement decisions of electric utilities and finds 
that vertical integration is more likely for "mine-mouth" electric generating plants, those 
which sit next to the source of their fuel. 

Hold-up 

The first allusion to the hold-up is Marshall (1890) (V.XI.33) who observes that partial 
monopolists, such as railway, gas, water and electrical companies, attempt to raise their 
charges on the consumer who has adapted his plant to make use of their services; he 
cites a Pittsburgh steel mill who after switching from coal to natural gas is faced with 
a sudden doubling of the price. Once the mill owner has sunk the cost of setting-up 
his machines, the cost of making steel includes only labour and energy so that it is 
still worthwhile to produce although the energy component is dearer; this is so because 
the capital component has been sunk i.e., does not enter into the equation anymore. 
This ex-post opportunism of the power utility amounts to hold-up the profits of the mill 
and can even reduce them so much that there is not left enough to repay the initial 
investment (sunk cost). 32@ Absent a binding contract or integration of the two firms, 
the utility cannot commit not to act opportunistically because this is just being rational. 
Reasoning backwards, the mill owner will invest less or not at all if he anticipates that 
this future hold-up will prevent him to recoup today's investment. 

Williamson (1971) and Goldberg (1976) reintroduce the topic in modern economics 
by comparing short term, long term contracts and the potentital benefits of vertical in- 
tegration. Klein et al. (1978) discussing the historically famous case of GM and Fisher 
Autobody, already mentioned by Coase (1937): Fisher made car bodies and sold them at 
a high price to GM who then asked Fisher to build a plant adjacent to the GM plants. 
Fisher refused and GM eventually bought Fisher out. The economic analysis goes as 
follows: suppose that after having signed a contract for delivery of car bodies, Fisher 
builds a plant nearby GM's ones. Once Fisher's plant has been set up, GM can threaten 
to break the relationship pretending to buy car bodies from another supplier; this would 
leave Fisher in a very difficult situation because he has specialized its assembly line to 
fit GM's models and can hardly sell those bodies to another car maker. Thanks to this 
ex-post opportunism, GM could force a renegotiation and obtain a better deal. The story 



goes saying that Fisher fearing not to be able to recoup his investment refused in the 
first place to make that investment (the plant located in Detroit) which in turn triggered 
the decision by GM to buy them altogether to make that desired investment. The topic 
is studied formally in §14.2. 

13.3.4 Property Rights Theory 

A critic 

Alchian and Demsetz (1972) developed the Property Rights approach in economics as a 
critical development of the TCE which, according to their view, fails to explain the deci- 
sion to integrate an asset. Grossman and Hart (1986) and Hart and Moore (1990) provide 
us with the first models of asset integration and argue that although transaction cost ar- 
guments explain well the switch from short term anonymous spot market relations to 
long term ones (repeated bilateral trade), they are silent with respect to the decision to 
buy the partner. Likewise, the problems of moral hazard or adverse selection identified 
by the agency theory are not improved by integration; it is difficult to sustain that more 
relevant information becomes observable after integration or is more easily used by par- 
ties to design better contracts. Whatever audit that the new integrated firm can perform 
to extract information and reduce agency costs could be mandated from outside if firms 
were not integrated. In other words, opportunism is likely to remain a plague within 
firms where decentralization and delegation is the rule. 

Ownership vs. Control 

Legally speaking, the ownership of an asset is associated with ownership of the revenues 
accruing from that asset. What the property right approach to the firm stresses is that 
ownership also gives power over the use of the assets. Grossman and Hart (1986) borrow 
from the theories of agency and transaction costs to build a contract theory oriented on 
the use of assets: 

• A firm is the list of assets it owns. 33@ 

• Ownership of an asset confers all rights over its use and the returns it generates. 

• The owner can cease a specific right like a specific use of the asset. 

• Transaction cost are large, contracts are incomplete, hence residual rights matter. 

• Residual rights give the power to decide on the use of the asset whenever something 

unexpected arises. 



Consider the example of a publisher Alan, a printer Bart and his press, the asset. A 
specific right is the use by Alan of Bart's press to print copies of a book. The contract 
among Alan and Bart can be fairly precise about the obligations of Alan regarding quality 
of the material to be printed or the obligations of Bart regarding quality of the printed 
material but it cannot possibly specify all details of the use of Bart's press in every 
conceivable situation, so that when something unexpected occurs it is Bart who decides 
what to do. For instance, it is not written in which language Alan's material must be 
submitted but the jurisprudence has stated that Alan has the residual right to choose 
any roman language while Bart can legally prohibit asian languages (that are far more 
costly to print). Now, if Alan buys Bart's press, he can decide to print in Chinese since 
he holds the residual control rights over Bart's asset (the press); the added cost will be 
borne by Alan. 

The decision for Alan to buy Bart's asset is really about the size of Alan's business; it 
is related to long term risk. By having the residual control rights over Bart's press, Alan 
makes sure that whatever happens in the future, he will be able to have the asset work 
for him if he needs it (to print rapidly more copies). He will not have to design a complex 
and costly contract to convince Bart to do what he wants. The potential negative side of 
integration is the distortion of the operator's incentives: Bart is likely to be less careful 
with the press once he has sold it to Alan. The property rights theory thus predicts that 
the owner of an asset should be the agent most able to use it; for instance my house or 
my car are my property because it maximizes my incentive to take care of them, hence 
to maximize their value of use. 

The Insurance example 

Grossman and Hart (1986) illustrates their property rights theory with the following 
analysis of the insurance industry classified into direct writers who own their clients 
database and indirect writers who don't. In the former case, the sellers of insurance 
might well be called employees since they are not able to sell insurance products from 
a different firm. In the latter case, the sellers are (independent) agents able to sell 
various brands of the same insurance product. From the point of view of the previous 
theory, there is a single asset, the client list, and the question simply revolves around its 
ownership (non-integration has no meaning). As we shall see now, the markets where 
one form dominates the other corresponds fairly well to the prediction of the theory. 

The investments undertaken by an insurer can be training of salesmen, advertis- 
ing, product development, policyholder services. A retailer, whether employee or broker, 
invest into building loyalty with its clients; it is a long lasting investment that can pro- 
duce long lasting premiums. Due to their non-verifiability, these investments cannot be 



contracted upon; instead insurers and retailers contract upon sales (initial ones and re- 
newals). In this typical moral hazard situation (cf. §20), retailers are induced to work 
hard only if companies design a commission scheme that goes beyond the signing of a 
client, for instance they can integrate renewals bonuses. 

If for reasons of cost or competition an insurer decides to give-up a profile of clients 
over a particular product in a particular region, then the ownership of the client list will 
make a difference for retailers. A sales employees who has spent a lot of time to sign 
clients will lose them together with the renewal bonuses; on the other hand, a broker is 
protected against such a situation since he will be able to costlessly switch its clients to 
a new company and maintain its level of satisfaction. 

Symmetrically when a company develops a new product it must agree to share future 
profits with independent brokers in order to have access to their clients. Brokers are 
also able to play one company against the other to extract more surplus from consumers. 
These opportunistic attitudes affect negatively the investments of insurers; there is a 
hold-up (cf. §14.2 on the hold-up). 

A life-insurance policyholder has less tendency to switch company than a the holder 
of a car insurance, also his renewal is much less sensitive to the retailer's effort (there's 
no claim until death occurs); hence the commission scheme can put most weight on the 
signing a new life policy and still ensure optimal effort from retailers. The retailer is now 
much less sensitive to the insurer's ex-post behavior and will therefore easily relinquish 
ownership of the client list. On the other hand, for all products like car, house or fire, 
where renewal is not guaranteed and depends on the current conditions of the policy 
as well as the care taken by the retailer, the latter is more likely to hold the client list. 
Stilzed facts support this view since in the US, 65% of property-casualty insurance is 
generated by brokers against 12% for life insurance. 

Wage bargaining 

Unions and firms periodically bargain over wages and employment. The standard ne- 
gotiation is on "wage rate" only, leaving firms free to choose later on the level of em- 
ployment. Another popular alternative is the "wage contract" negotiation whereby a 
minimum amount of labor-days is also agreed. 34@ Grout (1984) observes that in the UK 
(as opposed to the US), the legal protection bestowed upon unions prevents them from 
committing to any agreement i.e., they can always go on strike to grab an additional ad- 
vantage (this was true in the 1980s). Whatever the agreement initially signed, the firm 
anticipates the future renegotiation towards higher wages. Since technical efficiency in 
production commands to equate the MRTS (cf. eq. 2.1) to the ratio of wages over the cost 
of capital, the firm will under-invest. On the contrary, in the US where agreements are 



binding (enforced by the law), firms are lead to invest optimally. 

The same argument indicates that specific human capital investment by employees 
(e.g., learning to use an in-house software) is likewise inefficiently low since any worker 
(or his union) fears the future expropriation by the management of the additional profits 
his devoted attitude will generate. Generally speaking, building a reputation for sticking 
to its word permits to reduce the hold-up and thereby improves efficiency within the 
relationship. 



Chapter 14 
Vertical Integration 



Our starting point in this chapter is the observation that nearly all the goods and ser- 
vices we consume are the outcome of a complex process whereby raw materials are trans- 
formed into basic goods or components that are finally assembled into salable items. In 
parallel to this industrial vertical chain, we also find a chain of support services such as 
accounting, administration or marketing helping. Vertical integration occurs when two 
firms active at different but contiguous levels of these vertical chains, decide to merge. 
The reverse process is called outsourcing or spin-off. 1@ 

The object of this chapter is to formalize the arguments developed in the previous 
one regarding the forces behind the creation, expansion or contraction of firms. In the 
first section, we present the bilateral monopoly whereby two vertically related firms 
trade. The application of market power by each of them (aka double marginalization) 
is conducive of inefficiency which is a first reason to either integrate or sign restrictive 
agreements. We then turn to the hold-up problem which originates in the specific in- 
vestments that parties to a trade need to undertake in order to make the most of their 
relationship. The last two sections analyze the cost and benefits of vertical integration 
according to the property rights and transaction cost theories. 

14.1 Bilateral Monopoly 

In this section, we shall see that vertical relationships between independent entities may 
generate an harmful "double marginalization". We then study under what conditions it 
is beneficial for the parties to integrate vertically (in order to eliminate this problem). 
We also look at alternatives to integration and their potentially anti-competitive conse- 
quences. 



14.1.1 Complementary Monopolies 

Cournot (1838)'s treatment of the bilateral monopoly is in fact a study of side-by-side 
monopolists as would be Intel (firm A) and Microsoft (firm B) when selling respectively 
chip and operative system, to competitive computer makers. 

Given that each unit of final product incorporates components A and B, the unit cost 
of a buyer depends on the sum of prices and so does his demand (whatever his market 
power in his own market). From the point of view of A, B's price depresses the WTP 
of buyers; it is as if B's price was an added marginal cost for him. Now when firm 
B exercises market power, she sets her price above her own marginal cost, so that A 
perceives a demand further depressed. His own profit maximization will proceed as if 
B's marginal cost was higher than it really is, thus he chooses to reduce sales by naming 
an even greater price. It is now clear that the two monopolies have entered a spiral of 
increasing prices that moves the sales away from the joint profit maximizing solution 
where only true marginal cost would enter calculations. The limit is reached when both 
prices are best reply to each other; this constitutes a Nash equilibrium. 

Let us illustrate this phenomenon analytically. Given that each unit of final product 
incorporates components A and B, the demand received by A and B is identical and a 
function D{pA + Pb) of the total price of the two components. Firm A's profit thus reads 
7i A = p A D{p A + pB)-C A {D{pA + PB))- Since D {p A + p B ) = q can be inverted into p A + p B = P{q), 
we can rewrite profit as n A = {P{q) - p B )q - C A {q) so that the FOC for optimal sales is 
R m = C^ + Pb as if the marginal cost of firm A was inflated by firm B's price. 

If firms A and B were integrated or acted collusively, their joint profit would be n = 
(Pa + ps)D(p A + p B ) - C A (D{p A + p B )) - C B {D(p A + p B )) = qP(q) - C{q) where p = p A + p B and 
C{.) = C A {.) + C B {.). The optimal choice q would solve R m = C m = C^ l + C^. Letting pi = C l m {q) 
for i = A,B, we observe that the optimal price p = P{q) > R m {q) = C m (q) = p A + p B . 

We can now easily show that the sum of uncoordinated prices (side-by-side monopo- 
lists) overshoots the joint profit maximizer. If firm B was to price at ps then firm A's 
optimal sales would be q and she would achieve that objective by setting the price 
p A = P{q) - Pb> Pa- In response to that choice, firm B would solve R m = C^ + p A which 
would entails a quantity q < q and therefore a price p B = P{q)- p A = P{q) -P{q) + pB> Pb- 
This proves that the pair (p A , p B ) is not an equilibrium of the indirect interaction among 
A and B and that a stable situation involves greater prices for both firms. The Nash 
equilibrium is the solution of the system in 3 unknowns R m {q) = C^iq) + Pb = C^(q) + p A , 

p A + PB = P(q)- 

Consider for example, constant marginal cost C[ for i = A,B and linear demand D{p) = 
a - /3p; the efficient quantity is q* = a - /3(ca + eg), the joint monopoly one is q = \q* and 
it is easy to check that the Nash equilibrium is pa - a+ 2 ^~ CB \ Pb - 'g?"^ , leading 



to sales q = \q* ■ Uncoordinated firms further restrict trade as compared to the optimal 
choice of a joint monopoly. 

14.1.2 Vertical Monopolies 

The situation most interesting for modern economics is the face-to-face bilateral mono- 
poly illustrated below where an upstream potential maker of an item, the seller S ex- 
changes it with a downstream potential user, the buyer B. An example would be Sam- 
sung negotiating with Apple the making of flash memory hard drives for the iPod or a 
manufacturer like Nestle negotiating with a retailer like Carrefour: 

Manufacturer Retailer 

Raw — ► Intermediate — ► Final 

input Upstream good Downstream good 

Pareto Optimality 

Let us start with some definitions. If a quantity q is produced and exchanged, the seller 
incurs production cost C{q) while the buyer derives a net revenue R{q) (treating the item 
as a free input). Given an agreed total price F (or unit price Flq), profits are ns = F-C(q) 
for the seller and %b = R(q) - F for the buyer. The joint profit is thus n{q) = tis + tib = 
R{q) - C{q). This expression is maximized by the choice of q solving R m = C m and yields a 
maximum profit n = n{q). 

As we showed in §2.4.3 on bargaining, the parties will insist on trading q since any 
other trade q, whatever the unit price it involves, can be improved by this particular 
one to the benefit of both parties. There is indeed an additional value ti-tis-tib that is 
created when moving form q to q;it can be shared among the two parties, making both of 
them willing to accept the quantity change. If the renegotiation of the initial contract is 
costless, then this is necessarily the outcome (cf Coase theorem §8.1.3). What remains 
unclear is the average or unit price, since the cake can be shared in any way among the 
parties. 2@ Incorrect solutions to the bilateral monopoly problem will be presented at the 
end of the section. 

Incomplete Contracts 

Modern contract theory offers a compelling explanation of why the competitive price 
p = R m {q) = C m (q) is most likely to be observed. Consider an initial agreement between 
S and B to deliver q units at unit price p. The seller can at very low cost renege on his 
scheduled deliveries by blaming the delay on unexpected problems; he can nevertheless 



insist on being paid for the delivered units. Likewise, the buyer can renege on the sched- 
uled quantity by arguing that the undesirable units are unsatisfactory; he thus only 
pays those he accepted. What this trivial analysis of contract law suggests is that only 
the actually exchanged units have to be paid since it is easy to demonstrate to a judge 
that they were delivered and accepted (both signatures appear on the delivery receipt). 
Forcing both parties to perform with respect to the scheduled quantity is much harder 
because so many excuses are acceptable at first hand; it would take time and expertise 
to identify those which are acceptable and those which result from malevolence. 

The quantity optimally delivered by the seller is thus the minimum of the agreed 
one and the competitive one at the specified average price. As illustrated on Figure 
14.1, given an agreement over quantity q , the seller fulfills only if the average price is 
greater than p 2 , otherwise he delivers less than scheduled. The buyer, on the other hand, 
fulfills only if the average price is lesser than p\. Thus, only prices between p 2 and p\ are 
conducive of fulfillment of promises by both parties. As the quantity is increased towards 
the joint profit maximizing level q, the range of acceptable prices shrinks towards a 
unique level, the competitive price p. 



AP 




Figure 14.1: Sharing driven by contractual incompleteness 



Pure bargaining 



In many situations, it is possible to write a binding contract covering not only quantity 
but also price. Bargaining then takes place ex-ante at the design stage to determine how 
the value created by an optimal trade is shared among the parties. Let us assume that 



if negotiation fails, each party can secure a default payoff m for i = S,B. The quasi-rent 
to be shared by coordinating the sale is 6 = n-jiB-n s and this is where bargaining is 
involved. 

The bargaining theories recalled in §2.4.3 converge in the case of certainty towards 
equal sharing of the quasi-rent. To conclude, the jointly optimal quantity (yet often 
inefficient from the social point of view) will always be traded. 

14.1.3 Double Marginalization 

The Issue 

Many economists dissented with the above bilateral monopoly analysis; they adhered to 
a different and sub-optimal solution akin to Stackelberg (1934) leadership: one party, 
often the seller, names a unit price p while the other party, often the buyer, names 
a quantity q, trade then proceeds. 3@ If the buyer applies his market power upon the 
downstream segment of the market, he maximizes jib = R{q) - pq which leads him to 
demand the quantity solving R m {q) = p. The marginal revenue generated by the item for 
trade thus appears to be also the willingness to pay of the buyer. Now, the seller who is 
a first mover, anticipates this demand in order to maximize her own profit; she applies 
market power over the buyer's WTP. The seller profit being ti$ = qR m {q) -C{q), the FOC 
is C m = R m + q^§f < R m since marginal revenue is decreasing; this inequality means that 
the the seller selects a quantity lesser than the optimal one q. i@ 

A double marginalization is at work here because the buyer's demand is more inelas- 
tic than the market demand at the origin of the revenue R{q). Another way to see the 
sub-optimality at play is to reason that only if the seller forgoes his market power and 
sets price competitively (i.e., p = C m ) will the buyer demand the right quantity. The over- 
all sum of profit thus falls short of its maximum n when monopolies limit themselves to 
such simple allocation rules ("you name a price, I name a quantity"). It should therefore 
be clear that unless some external conditions impeach the parties to either integrate or 
design binding contracts covering both quantity and price, no sane businessmen would 
ever continue to trade under such sub-optimal rules. An extremely simple resolution is 
to allow the first mover to price discriminate the other party by making for instance a 
price listing or a bundle offer (cf §4.1.3 on consumer surplus extraction). 

Remedies 

Double marginalization does not show off if the downstream market is competitive since 
only the upstream firm has the ability to apply market power. Even in that case, the 



seller might fall short of maximum profits if the production of the final good uses several 
inputs that are, to some extent, substitutable. Consider the example of Tetrapak a lead- 
ing maker of carton packages for beverages. When Tetrapak, the upstream firm, raises 
its price to a monopoly level, the downstream retailers like the dairy producer Danone 
start to substitute the expensive carton for the now relatively cheaper plastic or glass 
bottles; they optimally adjust their mix of inputs so as to keep their technical rate of 
substitution equal to the price ratio. As a consequence, the downstream production uses 
an inefficient combination of inputs and the demand for the upstream monopolist's input 
falls (Tetrapak sells less carton). The ability to recoup full monopoly profits is lost for the 
presence of a substitute (plastic) to the upstream input (carton). The only way to restore 
profits is to integrate downward to control the choices of inputs or use the alternative 
contract presented hereafter. 

The solution uses price discrimination (cf. §4.1.3) with a two-part tariff where the 
unit price is set equal to the marginal cost; this option incentivizes the retailer to sell the 
monopoly quantity. It remains then to negotiate the franchise fee so as to share the joint 
monopoly profit among the upstream and downstream firms (cf. §2.4.3 on bargaining ). 
If there are several downstream firms, the subscription will need to be individualized (cf. 
§4.2.1 on perfect discrimination). The problem is that the fee usually paid on an annual 
base tends to be very large putting the retailer at risk of bankruptcy especially when 
demand can be fluctuating. 

To conclude, we might say that even if there is market power at all levels, there are 
many ways for the upstream firm to restore optimality (NOT efficiency) i.e., maximize 
the sum of profits for the firms involved in the production of the final good. 

Investment Incentives f 

We present here a preview of the general result of the next section on relation specific in- 
vestments and the hold-up problem, namely that firms invest less into their relationship 
than if they were under common ownership because they anticipate that future gains 
from trade will be shared according to a flexible rule, sensitive to their behavior. 

Imagine that revenue and cost can be enhanced by specialized investments i.e., ex- 
penses dedicated at increasing revenue or reducing cost but exclusively for the exchange 
of the item of interest for the two parties. Let kg and ks denote the amount spend by 
B and S respectively into that activity. We also denote C q = |^ and Q = |p and assume 
C qq < i.e., DRS and C q k < i.e., investment improves the marginal cost. The maximum 
joint profit being n{q) = R{q) - C{q), the marginal value of investment into the production 
technology for joint profits is j£- = ■§£- + -^- 1| = -Q by definition of q which continues to 
satisfy R q = C q . 



We have previously argued that the only stable exchange price among two indepen- 
dent firms is p = C q {q) = R q {q). Profits are thus ns = qp- C{q) and n B = R{q) - qp. Differ- 
entiating the FOC of optimality of q with respect to ks, we obtain -£- = p f r > which 
will be used hereafter. The marginal value of investment into the production technology 
for the seller is -^ made of three terms. The first and direct effect is -^ = -Q as in the 
joint profit case. The second term is the indirect effect through the price ^-g§- while the 
third term is the indirect effect through the quantity -j^j- -=$- . If we are able to show that 
the sum of indirect effects is negative then the seller puts less value on investment than 
a joint owner. Observe first that -^ - q and -j^-^-^ + p-Cq-q^ since p = C q {q) by 
definition of the agreed trade. Then, the indirect effect is 

dp dp dq I Cqq \ RqqC qk 

dk s dqdk s \ R qq -Cqq) Rqq-C qq 

as soon as revenue displays DRS, a feature of most markets. 

A symmetrical inequality obtains when comparing the private marginal value of in- 
vestment into the revenue technology and the jointly optimal value. This result is a for- 
malization of Williamson (1971)'s under-investment into assets that are relation specific 
when parties anticipate that future gains from trade (n) will have to be shared according 
to a flexible rule. 

Indeed, if it was possible to specify a fixed total payment F in return for delivering the 
optimal quantity then each party would face the ideal incentives to invest. 5@ The reason 
why F is difficult to choose is because uncertainty can provoke wide variations into R or 
C that could leave one party bankrupt if the payment F had to be fulfilled. It is therefore 
quite likely that F will be renegotiated, but if this is so, parties will anticipate this fate 
and modify accordingly their investment choices. 

14.1.4 Resale Price Maintenance 

We illustrate here how a resale price maintenance (RPM), seen in §9.2.2 on antitrust, can 
be an alternative to integration in order to overcome the double marginalization problem 
(cf. §14.1.3). Imagine that a manufacturer sells a product to competing retailers using 
a two-part tariff (p,F). There are neither production nor distribution costs (r = c = 0). 
Each retailer faces a market demand D{p) = 1 - p divided in two segments, one of size 
a is protected from the competition of other retailers, for instance because the involved 
customers have a high opportunity cost of searching for lower prices. The remaining 
segment of size 1 - a is open to price competition among retailers. The dichotomy could 
alternatively be between households and firms since the latter are more able to dedicate 



resources to finding a better deal. 

The industry maximizing behavior is to set the monopoly price p = \ in order to earn 
n M = \ on each retail market (and \ on average). Let us denote w the wholesale price 



set by the manufacturer. In absence of a RPM, the equilibrium retail prices are ^^ over 
the monopolized segment and w over the competitive one, leading to individual retailer 
profits of a a ~ 4 w) which the manufacturer absorbs using a franchise F. Since the sales 
to each retailer are ^^- and 1 - w over the two respective segments, the manufacturer's 
profit for each market is 

n{w) = a [± ^- + w [a 1 ^- + (1 - a)(l - wj) = (1 - w) ((4 - 3a) w + a) 

and is maximum for w a = 2^£^ > 0, so that the final prices in the competitive and mo- 
nopolized segments are ranked as follows: w a < \ < -^p 1 . The absence of RPM and the 
oligopoly structure in the downstream market therefore leads retailers to adopt a price 
discrimination that is wasteful for industry profits; although the manufacturer recap- 
tures all profits through franchise fees, he earns only n{w a ) = \ { llf a , a convex function 
that reaches n M at a = and 1 only. 

To restore full monopoly profits when there is heterogeneity of consumers (0 < a < 1), 
any of the three RPM will do the job: 



• set F = 0, w = \ and fix w as a retail price. 

• set F = 0, w = \ and set w as a maximum price. 

• set F = |, w = and set a minimum price of |. 

The first strategy is tantamount to integration while the last one is the usual expla- 
nation for minimum prices: the wholesale price is set at the marginal cost (here 0) to 
avoid double marginalization and a minimum price guarantees that retailers will not 
loose their margin in price wars. The second strategy is a little known rationale for max- 
imum retail prices: the wholesale price is set at the monopoly level corresponding to the 
competitive segment while the maximum price prevents retailers from overexploiting 
captive customers (double marginalization again). 

In other words, if there are n classes of customers upon which price discrimination 
can be applied, the usual double marginalization problem is multiplied by n, so that the 
manufacturer needs n - 1 new instruments to guide the pricing behavior of retailers to- 
wards industry maximum and rip the whole profit using the franchise fee (with adequate 
sliding). If, like in many markets, there are roughly two segments of captive and com- 
petitive (bargain seeker) customers, then a single RPM in each class does the job. The 
econometric study of Shepard (1993) on gasoline price at stations integrated or not with 
refiners confirms the theoretical findings of the model. 



Observe that in this example, the average price without RPM is a^p 1 + (1- a)w a = 
\ 4 ~4-la a) > h ^ ne avera g e price with RPM; thus RPM is welfare improving since for ho- 
mogeneous goods, total quantity is a faithful indicator of market welfare. This need not 
be true in more general models. Still, the case against RPM cannot be concluded in a 
clearcut manner from the state of actual research. 

14.2 Specific Investment (aka Hold-up) 

In §13.3.3, we commented upon the rational opportunistic behavior of a party to a bilat- 
eral trade who tries, by haggling, to grab the greatest possible share of the value to be 
generated by the exchange. When a party holds-up the fruit of the other party's effort 
to improve the relationship, the latter rationally refuses to invest efficiently; integra- 
tion (vertical or horizontal) might then be a solution to force a cooperation at the ex-post 
phase thanks to the imposition of agreement by fiat (authority). 

In this section, we present formally this theory and the contractual solutions that 
have been proposed as well as their limitations for two families of problems: trading 
a variable volume or a unique asset. We assume ex-post efficiency i.e., parties have 
the necessary information to identify an efficient course of action once all the relevant 
information has been revealed. In other words, they are able to bargain quickly over the 
rent to be shared without haggling or entering into wasteful rent-seeking behavior. The 
property rights theory (PRT) taken up in the next section, carries on with this hypothesis 
and develops models of the limit to the boundaries of the firm based on the resolution 
of the hold-up. The transaction cost approach (TCE), to be studied in the last section, 
emphasizes on the contrary the frequent inefficiency of ex-post haggling over quasi-rents 
and makes it the main reason for integration. 

14.2.1 Ex-post Opportunism 

Specific Trade 

When a firm buys a quantity q of input on the world market at price p, she has efficient 
incentives to invest into her technology. Letting A; denote the monetary investment, her 
profit is 7iB = R{k,q) - pq-k. Even if the market price is random, the FOC of optimal 
investment k* is ^f = 1 which is efficient. This is so because the unit price, however 
shaky it may be, is completely insensitive to the investment effort procured by the firm 
(recall it is world price determined by the aggregation of literally hundreds of bidders). 
As we shall see, bilateral trading changes this. 



Consider now the case where the specific needs of the firm B force her to buy from 
a dedicated seller S whose cost is C{q). They negotiate a quantity and a price. The 
potential value of trade is R{q) - C{q) and will optimally be maximized by the parties if 
there is symmetric information i.e., each knows the valuation (revenue or cost) of the 
other party. The efficient (ex-post) quantity is q solving R m = C m ; it leads to a surplus or 
quasi-rent 6 = R{q) - C{q). If the negotiation fails, each party continues with his activity 
and earns a default profit assumed nil for ease of exposition. 6@ Notice that uncertainty 
could be present and affect the ex-post valuations R and C. 

It remains to negotiate the exchange price and this is where the hold-up arises. We 
assume w.l.o.g. that the surplus 6 is equally shared 7 ® i.e., p = { ' q l^ {q \ The buyer's 
profit is now n B = \{R(k,q) - C{q)) - k, so that investment incentives have been halved 
(the marginal gain of an additional investment is only 50% of the wealth so created) A 
completely symmetric argument holds for the input seller. 

The hold-up refers to the fact that the inescapable negotiation over the price charac- 
teristic of a bilateral relationship forces the buyer to share the fruits of his investment k 
with the seller, he thus become a partial claimant whereas efficiency requires him to be 
a residual claimant. Whenever some aspect of the investment can be contracted upon, it 
is in the interest of the parties to do so and specify that the efficient level k* should be 
carried on. If this is not possible (or too costly), then integration might solve the invest- 
ment problem since the common owner will internalize the externality that the seller's 
opportunism imposes on the buyer. 

In §14.1.3, we obtained the same qualitative result when the "competitive" price was 
the equilibrium of the bargaining; this occurred due to the impossibility of signing a 
binding sales contract. 

Commitment 

We now show the fundamental role of commitment in the appearance of the hold-up i.e., 
the incentives towards investment differ according to whether the trade is negotiated 
after or before the investment is sunk. Let the maximum ex-post surplus be n{k). If the 
trade is agreed before the investment, the buyer's default payoff (in case of disagreement) 
is zero while if the trade is agreed after the investment, the buyer's default payoff is the 
resale value pk of the capital invested. We assume that p < 1 because the investment 
was specially geared towards the trade with the seller, thus has less value in alternative 
uses. The two settings also differ with respect to where the investment cost is imputed 
into the buyer's payoff; if trade is agreed ex-ante then the investment cost is integral part 
of the negotiation while it is not if trade is agreed later on. We can now easily derive the 
buyer's ex-ante payoff with rational anticipation in the two settings and compare them. 
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Table 14.1: Hold-up and Commitment 

When the agreement to trade is binding, the buyer is a residual claimant of his invest- 
ment although he nets only one half of the absolute surplus generated by the exchange. 
When the agreement to trade is not binding (i.e., renegotiated at will), the buyer suffers 
a hold-up due to the opportunistic behavior of the seller; his optimal investment is thus 
inefficiently low (cf. §13.3.4). 



Contractual Instruments 

Chung (1991) shows that a simple contract can resolve the mutual hold-up if two con- 
ditions apply: one party can be given full bargaining power ex-post and it is possible to 
specify a default trade [q, p) or at least an outside option to the other party. Regarding 
the first condition, we argued in §14.1.2 that parties will try to act opportunistically at 
the delivery-reception stage so that enforcing a specific quantity performance is diffi- 
cult. To make a party dominant ex-post, it is possible to write in the initial contract that 
this party, say the buyer, will lead ex-post negotiation and make a "take-it-or-leave-it" 
trading offer. The seller then is forced to accept or walk away from the relationship. 8 ® 

The general formulation of the hold-up sees both parties invest into specific value 
enhancing assets so that revenue is R{ks,q) while cost is C[ks,q). The efficient ex-post 
quantity q continues to solve the same FOC but is now contingent on past investments. 
The trading surplus is n = R[ks, q) - C{k$, q) while the default payoffs are ub = R{kB,q)- qp 
and us = qp- C{ks, q). The quasi-rent to be divided during the ex-post negotiation is thus 
8 = ji— ub~ us- 

Having all power, the buyer's offer grabs the entire quasi-rent so that his profit is 
n B = Ub + 8 - ks = 7i - k B - us- By construction, the buyer has the right incentive to invest. 
Since the seller has no bargaining power, his ex-post gain is us and ex-ante, his profit is 
TiB - qp - C{ks, q) - ks- Given the efficient seller investment A;^ solving d £' q = 1 (where, 
recall, q is a function of kg), it is possible to tune the constant q so that { dk s ' q] = 1 (thanks 
to the intermediate value theorem) in order to induce the seller to make the efficient 
investment. 



14.2.2 Buy-Out 

We leave aside bilateral trading and concentrate on the exchange of a single asset such 
as an infrastructure, a software, a work of art. The asset requires specific investments by 
both sides to capture all its potential value. If the agent's investment could be contracted 
upon, there would be no hold-up since the principal, as the asset final owner, would 
be the residual claimant of the value added by her own investment. She would thus 
instruct the agent to invest efficiently and would have the natural incentive to invest 
efficiently herself. However, many dimensions of the agent's duty, although observable, 
are non contractible. This means that investment incentives must be provided for him 
to act diligently. To alleviate the hold-up problem, it is possible to supplement the initial 
contract with a clause whereby the principal can force the agent to "buy-out" the asset in 
order to discipline him. Alternatively, the contract first sells the asset to the agent and 
specifies a repurchasing price for the principal and a later date at which he may exercise 
this option or "opt-out". 

Demski and Sappington (1991) show that if the parties can stick to their initial agree- 
ment i.e., commit not to renegotiate, then efficient investments obtain for both parties. 
Edlin and Hermalin (2000) however qualify this result because the proposed contract is 
not robust to renegotiation i.e., the principal can hold-up the agent by threatening to 
opt-out. Efficiency obtains only if investments are substitutes at the margin. 

An animated example 

Edlin and Hermalin (2000) illustrate the issue at stake with the Pixar-Disney collab- 
oration for the movie Toy Story. Each party brought unique talents: Pixar had the 3- 
D animation technology and Disney had the distribution and marketing expertise for 
animated films. Prior to release, Disney could observe the quality of the film Pixar pro- 
duced, but it is difficult, if not impossible, to describe quality in a contract or demonstrate 
it in court unambiguously. Giving Pixar appropriate incentives to make a great movie 
would, then, seem problematic. Disney could commit to buy the movie at a fair price but 
then Pixar, acting opportunistically, had no incentive to work hard which constitutes the 
archetypical case of moral hazard. If instead, the price is negotiated after the making of 
the movie then Pixar is victim of the hold-up problem for Disney will opportunistically 
(but rationally) bargain over the price and capture some of the value that Pixar created; 
Pixar anticipating this fate will not work so hard to make the perfect movie. 

A good compromise is to fix a price initially and give Disney the option to cancel the 
deal. Buying the film at that price will only be attractive to Disney if Pixar makes a 
sufficiently good movie. The optimal price should equate the film's final value to Disney 



assuming Pixar exerts optimal effort. Absent renegotiation, this contract is efficient, but 
Disney has an incentive to let its option expire and subsequently renegotiate a lower 
price for the film threatening to leave Pixar alone to market the movie. Even if the 
parties make the option non-expiring, renegotiation is still a relevant threat: Disney has 
an incentive to delay promotion and distribution until it can renegotiate a better deal. 

Hence, the option contract provides little protection from the hold-up problem. Yet, 
there is a second and dynamic dimension to this story: Pixar's effort (making a movie) 
strengthens his bargaining position, since if bargaining breaks down Pixar ends up own- 
ing a great movie and valuable know-how that could interest Disney's competitors like 
Time-Warner, Dreamworks or Universal. 

Facts confirm this theory. In 1991, Disney and Pixar signed an agreement for the 
development and production of up to 3 movies whereby Disney kept 85% of all profits 
but financed production, distribution and marketing costs. After the success of Toy Story 
in 1995, Disney and Pixar signed in 1997, a new agreement replacing the former, for 5 
movies whereby production costs and profits were equally shared. Later on, after the 
tremendous success of Finding Nemo, Pixar decided it would not renew its association 
with Disney after delivering the fifth agreed movie. Finally, after several more block- 
busters, Pixar had acquired so much goodwill that Disney had to buy it in a remake of 
the General Motors vs. Fisher Autobody story analyzed by Coase (1937). 

Solution with Commitment f 

The following simple agency model partly formalizes the Pixar-Disney story by introduc- 
ing a one-sided hold-up problem and showing how it can be solved if firms can stick to 
their initial agreement i.e., commit not to renegotiate. In the next section we shall see 
under what conditions, the unavoidability of renegotiation breaks this positive result. 

A principal (she), who owns a transferable asset like a patent, a store, or a movie 
idea, hires an agent (he) to realize a project by applying labor L > upon the asset. 
Later on, the principal invest K > 0, for instance in advertising; both decisions are non 
contractible 9@ and are therefore chosen according to rationality i.e., each party takes 
the action maximizing its objective at that moment. The expected profit generated by 
the asset is R{K,L) with Rl>0 and Rk>0. We denote IT(L) = max K R{K,L) -K, the maximal 
value of the project given the amount of effort L previously applied; it satisfies n^ = Ri by 
the envelope theorem. Lastly, the reservation utility of the agent is set w.l.o.g. to zero so 
that the efficient effort (first-best) is L* maximizing Yl{L) -L i.e., solving R L = 1. 

The optimal contract maximizes the principal's expected profit subject to participa- 
tion of the agent. Consider an option contract ipi,p2) offered by the principal to the 
agent. Upon acceptance of the contract, the principal transfers ownership of the asset 



to the agent for a price p\. After the agent has applied his effort, the principal has the 
option to buy back the improved asset at price p 2 . If she declines to exercise her option, 
the agent retains ownership. After deciding whether to exercise her option or not, the 
principal chooses her investment K. The final owner of the asset receives the return 
R(K,L). 

Demski and Sappington (1991) propose p* =U{L*)-e-L* and p* =Yl{L*-e) where e>0 
is small. We analyze the game tree by means of backward induction to check whether 
this option contract implements the optimal actions. Assume the agent has accepted 
(p*,pp, exerted some effort L and that the principal has exercised her buy-back option. 
Conditional on L, she will optimally invest and earn Yl{L). Thus, she should exercise 
the option only if U{L) > p* o L > L* - e. Knowing this, the agent can choose L> L* -e 
to earn p* 2 -p\-L which is maximum for L = L* - e; this way he nets p* 2 - p\ - L* + e = e 
(by Taylor's formula using the optimality of L* ). If, on the contrary, the agent chooses 
L < L* -e, then the principal does not exercise and invests K = so that he will earn 
R(0, L) - L - p{ = R(0, L) - L - R(K* , L*) + e < e - L* < by the optimality of L* . 

The contract (p*,p|) was obviously designed to make the first choice dominant; this 
way the agent earns a net utility of e > so that accepting the contract is initially an 
optimal choice (dominant strategy). Letting e tend to zero the principal can obtain the 
first-best surplus U{L*) -L* if the agent is rich enough to pay that amount at the initial 
stage. 

This wealth requirement is problematic in an incomplete financial market and makes 
the proposed solution quite infeasible. Indeed, the agent is typically not very rich (think 
of the young company Pixar back in 1990) while the up-front payment can be very large 
(think of the profits that a Disney blockbuster can generate), thus the agent must raise 
the money from the financial markets. Now bankers who are not animated movie special- 
ists have a difficulty to identify whether the whole idea will be profitable; they therefore 
tend to underestimate the expected return of the asset and either refuse to fund or ask 
a large risk premium to lend the agent the required amount Yl{L*)-L* . 

Inefficiency of Renegotiation f 

Committing not to renegotiate is problematic as it goes against one of the Pareto prin- 
ciple: rational agents understand the benefits of renegotiating an agreement leading to 
Pareto dominated payoffs. If there is perfect information and negotiation is costless then 
the final agreement is bound to be Pareto efficient. Edlin and Hermalin (2000) show that 
this possibility hinders the previous analysis; if renegotiation cannot be prohibited, then 
the previous option contract must be amended and can still implement the first best if 
and only if investments are substitutes at the margin. 



If the principal, upon having sign the contract (p\,p2), refuses to buy back the asset, 
the agent is left with a profit of R{0,L) -pi-L because the principal will not invest at 
all. By definition of U(L) we have U[L) > R{0,L) thus the asset value is R{0,L) -p\< U{L) 
for any positive price p\ which means that there is always scope for a renegotiation of 
P2 (whatever its original value) in order to reach the better outcome where the principal 
buys back the asset and invests efficiently in it, in order to increase the asset value to 
n(L). To simplify the analysis, we assume equal bargaining power. 

The amount over which they bargain is 5 = U{L) - R{0,L) + p\ thus the renegotiated 
price is p' 2 = P{L,p\) = R{0,L) - p\ + \8 < U{L) (we are in the case of renegotiation thus 
R{0,L) -p\< n(L)). We now see clearly that if the principal offers the previously optimal 
contract (p*,/^), the agent will refuse it because he anticipates that the future exercise 
price will be P{L* ,p* x ) < p 2 = U{L*) which means he won't cover his own investment L* . 

This problem can be overcome when efforts are substitutable, that is when the agent's 
effort improves greatly the value of the asset without the help of the principal; an 
example might be the Pixar case since Disney had no role in the making of the movie. 
If it is indeed true that Rkl\{k*,l*) < °> we can se t Pi = p{, Pi = P(L* ,p\) < II(L*). Observe 
that P L = Rl(0, 3 )+ni > n L > because U L = R L (K{L),L) < R L (0,L) by the envelope theorem and 
the substitutability of efforts. 

Suppose now that the agent expands the effort L > L* ; if the principal decides to 
renegotiate the option price p 2 , she ends up paying P{L, p*) > P{L*, p*) = p 2 because Pi > 0. 
Thus, it is better to exercise the option at p 2 without renegotiating. Symmetrically, it is 
beneficial to enter the renegotiation process if L < L* . Over the domain L> L* , the payoff 
of the agent is the decreasing function P{L* ,p\) -L; thus the optimal effort lies below L*. 
Over the domain L<L*, the payoff of the agent is U[L) = P{L, p\) - L with U L > U L - 1 > 
because Ul{L*) = 1. Overall, L* is the optimal choice (a corner solution though) and since 
the initial price is p\ , the previous proof applies. 

If efforts are complementary, then no option contract can implement the first best 
effort from the agent because the renegotiated price P increases too slowly with effort 
(p L < i => Ui < 0). Unfortunately this is the most relevant case since the very reason why 
the principal and the agent decided to form a team was to internalize possible synergies 
i.e., because they expected their efforts to be complementary. 

An example of complementarity vs. substitutability is as follows. Suppose that a US 
teen goes to the movies either if a friend recommends it or if he is impressed by the mar- 
keting or TV advertising. Let z{L) be the probability that a friend does not recommend it 
(zl < 0) and let y{K) be the probability that he is not impressed by the marketing ()>k < 0). 
Assuming independence, the probability he goes to the movie is 1 - z{L)y{K), which has a 
negative cross partial derivative, implying that efforts K and L are substitutes and that 



the first best is implemented by an option contract. On the other hand, a European teen 
being more solicitous will go to see the movie only if he receives two good signals so that 
the probability is {l-z[L))[l-y{K)). Noticing that this expression has a positive cross 
partial derivative, we deduce the complementary of efforts. 

The general conclusion is that the consumer market plays an important role for the 
optimal allocation of incentives in production: never forget that the customer is king! 

14.3 On Property Rights 

In this section, we emphasize the role of property rights over assets as an instrument 
guiding the incentives of the parties to perform relationship-specific investments. 10 ® 
Whereas TCE focuses on the level of quasi-rents as the main reason for integration, 
PRT focuses on the investments that can increase the quasi-rents. 

14.3.1 The Control of Incentives 

According to the synthesis offered by Hart (1995), parties engaging in a transaction 
write contracts that are ex-ante incomplete, but complete them ex-post once uncertainty 
resolves. The ability to exercise residual control rights improves the ex-post bargaining 
position of an asset owner and thereby increases her investment incentive as well as the 
incentives of those who enjoy gains from trade with her. As a consequence, it is optimal 
to assign asset ownership to those who have the most important relationship-specific 
investments, or who have indispensable human capital. 

Private Benefits 

To illustrate how firm boundaries affect decision making, consider two production units 
or divisions A and B. Each unit i = {A,B} generates monetary profits vt and private 
(nontransferable) benefits Wi in the form of job satisfaction for those working in the unit. 
If the units are independent, manager i being the boss of unit i, maximizes vi + u>i since 
she owns the profit from unit i and cares about her own private benefits. 11 ® In contrast, 
if units A and B are integrated, then what happens depends on who is the overall boss. 
If A is the overall boss, she maximizes v A + v B + wa, since she owns the profit from both 
units, and cares about her own private benefit but not B's. A similar formula holds if B is 
the overall boss. Finally, if a (professional) outsider is brought in to be the boss, she will 
maximize v A + v B , since she owns all profits and does not care about the private benefits 
of A and B. 



Given that the social optimum is achieved by maximizing total surplus Va + vb + u>a + 
wb, we can conclude that under independence, bosses maximize the right thing, profits 
plus private benefits, but are parochial since they fail to take into account their exter- 
nality on the other unit, while under integration they maximize the wrong thing, their 
sole profit but have a global scope. This demonstrates that both integration and inde- 
pendence have merits and defects. 

Diversion of Benefits 

In §13.3.4, we argued that the owner of an asset should be the economic agent able 
to use in the more efficiently. A simple example will help to understand this claim. 
Consider a machine like the printing press of our original example that necessitates 
only managerial effort e, measured by its opportunity cost and generates a deterministic 
profit n{e) (return) without creating any externality. In the presence of moral hazard 
i.e., when effort cannot be controlled, the agency theory shows that the manager's wage 
should be tied to profits in order to give him efficient incentives for effort; yet this does 
not makes him the owner of the asset. 12@ In the present case, the efficient effort e* solves 
n'{e) = 1 (marginal benefit equals marginal cost). The manager will optimally expand 
e* only if his wage is w{e) = n{e) -F i.e., if he owns the return (which must be clearly 
identifiable). Yet, there is no need for him to own the machine i.e., have the right to 
decide on the future use of the machine. 

In more realistic situations there is still a difference between ownership of asset and 
return. Indeed, the manager's effort hardly generate cash per-se; instead it generates 
an input for another production process. In our example, the printer makes books that 
must be sold later on by the publisher. The latter is thus able to divert some benefits of 
the printing activity, say a share A, in a way that is non contractible. For instance, he 
uses the marketing activities organized around the publication of the book to promote 
other works by the author (under contract with him). Hence at most 1 - A of the return 
can be contracted upon between the printer and the publisher. If the printer's wage is 
w{e) = (1 - A)n(e) - F then his effort will be lower than e* because the publisher "holds-up" 
a share A of the profits created by his efforts. The only way to restore efficiency is for the 
printer to buy the publisher in order to reclaim the missing A% of the benefits. 

14.3.2 Relation Specific Investments 

The basic idea 

Grossman and Hart (1986)'s model as recast by Gibbons (2005), considers investments 



that are observable by the parties but not verifiable by a third party 13 ® Let i = N,B,S 
denote "Non integration", "Buyer integration" and "Seller integration". After the parties 
have decided over their ownership structure, they invest in assets that are specific to 
their future trade. Later on, uncertainty resolves and from this moment on, the relevant 
characteristics for the trade become contractible (complexity has been reduced). Let us 
analyze the outcome if there is no renegotiation of the initial contract. If one party 
has integrated the other, she controls all relevant dimensions of trade and thus chooses 
the characteristics maximizing her sole profit. Under independence, each party retains 
control over some relevant dimensions and enters a rent-seeking process where she tunes 
the characteristics under her control to maximize her profit taking the choice of the other 
party as fixed; there is a Nash equilibrium. We thus have in each case a default trade q l 
that depends on the ex-ante investments and the current conditions. 

However, when the trading period arrives, both parties reason that their previous 
investment together with the ex-post trading conditions make a particular trade q* ef- 
ficient i.e., it maximizes W{q) = n B + ns and is different from all the default trades men- 
tioned above. The ex-post quasi-rent is thus 5 = W{q*) - W{q l ) and being shared equally 
among parties, the ex-ante profit of the buyer is thus 

U B = n B {q l ) + 2 5 = 2 f 7 * 5 ^ + n s{q*)) + cte (14.1) 

with a symmetric formula for the seller. The buyer's incentives to invest ex-ante are 50% 
of the marginal return at the efficient trade q* plus 50% of the marginal return at the 
inefficient trade q l . Whatever the ownership structure, it is never the case that a party 
to this relationship has first-best investment incentives. Grossman and Hart (1986) then 
observe that if the ex-post objective of each party mostly depend on the characteristics 
she controls then q N is nearby q* so that non integration is almost efficient which makes 
it the dominant choice. If on the other hand, a party's objective hardly depends on the 
characteristics she initially controls, then integration under the control of the other party 
is almost efficient which makes this ownership structure the dominant one. 

A refined model 

Whinston (2003) develops more formally the argument. Consider a bilateral trade set- 
ting involving a buyer B and a seller S. Ex-ante, some specific contractible investments 
are negotiated between B and S to enhance the value of their relationship but some el- 
ements kb and k s are non contractible (e.g., human capital) and are therefore chosen 
independently at a cost c{k) = \k 2 for each party. At that time, the parties can choose to 
integrate (0 i.e., the buyer buys the seller's assets 14 ® or not (n) i.e., remain independent. 



Ex-post, the relevant information about the quantity and quality of the good to be traded 
is revealed to the parties. We make an assumption that greatly simplifies notation by 
dropping reference to the ex-post efficient quantity or quality and only keep the ex-ante 
investments. The buyer's ex-post payoff from trading under design 7 = i,n is denoted 
% b = a b + P b k b + y b k s where the /3 parameter indicates the own effect of one's investments 
while 7 indicates the cross effect. A symmetric formula holds for the seller. 

The key argument to all incomplete contracting theories is that, whatever the ex- 
ante contractual design, be it integration (z) or independence (n), it becomes inadequate 
once all useful information has been revealed, thus an ex-post renegotiation can take 
advantage of unexpected trade opportunities or adjust some dimensions of the trade to 
take into consideration unexpected developments be they positive or negative. Unlike 
TCE which emphasizes the cost of such a renegotiation, the PRT assumes away trans- 
action cost. Ex-post, parties maximize their joint ex-post payoff whose value is denoted 
n{k b ,k s ) = a + iu. b kjt, + [i s k s . Since the joint value of the trade without investments a is 
greater once ex-post adjustments are made we have a > a b + a s . Likewise, the value 
of each investment is greater if ex-post adjustments are made thus the slopes satisfy 
\i b > fi b + jl and ij. s > p J s + y b for design j = i, n. The first-best characterized by contractible 
investments would maximize the net surplus W = n{kb,k s ) - \k\- \k\\ efficient invest- 
ments are then [i b and [i s . Since the point here is that those human capital investment 
cannot be contracted upon, both designs will yield sub-optimal investment. The optimal 
design is then that which is less inefficient. 

To simplify further the study, we assume equal bargaining power so that the quasi- 
rents 6 J = n-n b - n{ are evenly split among the parties for design j = i,n. The buyer's 
ex-ante profit is thus, 

< = *i + S*'-2*J = \[n^n j b -n{-k 2 b ) 

ex a + a b - a s + \ji b + p J b - ji) k b -k 2 b + (^ - p ] s + j J b ) k s 

for design j = i, n. The optimal investment for the buyer is easily characterized as 

k i = \ [fb + Pb~ Ys] for desi £ n J = f > n ( 14 - 2 > 

The formula for the seller is entirely symmetrical. 

Since there is always ex-post cooperation, the total welfare achieved under design 

j = i,n is 



The likelihood of vertical integration is seen though the changes in A = W l - W n in- 
duced by changes in the parameters. For instance, what is the change due to an increase 
in the intrisic value of the buyer's human capital (jj. b ): 



dA 
d[i b 



K - K + (mi, - K) Jg - [» b - *») g = I (4 - *») (14.3) 



Define the selfish value of investment as the difference between own value f5 3 b and 
the value for the partner y{ . One would intuitively assume that the buyer's investment 
has more selfish value under integration than under independence. If this is so, then 
fi l b~Y\ > /^ _ y" (check with (14.2)) so that the buyer invests more under integration than 
under independence. We can now interpret (14.3): an increase in the intrisic value of 
the buyer's human capital \i b makes integration more frequent. Likewise, if the seller 
invests more when he remains independent (k" > k l s ), an increase in the intrisic value of 
his human capital fi s makes integration less frequent since Jj- = ^{k l s - kg). As the PRT 
claims, integration should occur if one's investment becomes more crucial to generate 
value. 

State Owned Enterprise 

Schmitz (2000) studies the optimal ownership of an asset such as a network used for the 
provision of a public service. When the State is the owner one speak of public provision 
while privatization refers to private ownership and a delegation to provide the service. 
Lastly, joint ownership or Public-Private-Partnership (PPP) refers to the situation where 
both parties hold veto power over the use of the asset. 

Privatization of a public enterprise changes the manager's objective towards profit 
maximization so that cost cutting takes right of way; this is conducive of more efficiency 
yet too much cost cutting tends to deteriorate the quality of the service so that a negative 
externality is imposed. Under public provision, the State can design service innovations 
and implement them at will. This is conducive of efficiency yet, too much innovation 
inflate the cost of the operator so that another negative externality is imposed. The fol- 
lowing analysis examines how the aforementioned ownership structures mitigate those 
externalities generated by a double moral hazard problem. 

The firm S, seller of the service, invests fc§ now to cut his future cost of service or 
equivalently earn an extra revenue Rsiks) though it deteriorates the objective of buyer 
B, here the government (e.g., consumer surplus) by a damage dsiks). The government 
spends k B on innovation to improve her future objective by some extra revenue i?s(A; B ) 
though it augments the agent's future cost by ds{k B ). The decision to implement an 



activity is taken ex-post by the asset owner once he has gathered all the relevant infor- 
mation. The decision maker is aware that his decision might hurt the other party and 
may be willing to negotiate ex-post an agreement. We assume that ex-post quasi-rents 
are shared equally. Let 8 s = Rs(ks) - dsiks) and 8 b = ^b(^b) - dsika) denote the ex-post 
value or quasi-rent associated with each activity. Assuming both positive, it is always 
efficient to carry on each activity so that maximum welfare is W = 8 s - ks + 8 b - k B and the 
first best levels thus solve 8' s = 1 and 8' B = 1. We shall compare the investment incentives 
under various ownership structure with the first-best. 

Under public provision, G can decide to carry on both activities but would only im- 
plement the innovation since the other one hurts him. The stake for negotiation is thus 
8s i.e., she will carry on the cost cutting activity if the agent pays her more than her 
potential loss dsiks). In terms of investment incentives, G sees only his gain Rsiks) and 
ignores the negative externality dsikg) imposed on the agent, she thus over- invests be- 
cause she is a parochial residual claimant. Regarding, cost cutting, the agent gets only 
one half of the quasi-rent 8s, he thus under-invests because he is a partial claimant of 
the global welfare. Under private provision, M can decide to carry on both activities 
but would only implement the cost cutting effort since the other one hurts him. The 
stake for negotiation is thus 8b i.e., he will carry on the innovation activity if the princi- 
pal pays her more than her potential loss dsiks). In terms of investment incentives, M 
sees only his gain Rsiks) and ignores the negative externality dsiks) imposed G, he thus 
over-invests being a parochial residual claimant. Regarding, innovation, G gets only one 
half of the quasi-rent 8b, she thus under-invests because she is a partial claimant of the 
global welfare. Under joint ownership, nothing is implemented by default since each 
party would veto the activity hurtful to him. The stake for negotiation is thus 8b + 8s and 
being shared equally among the parties, each is a partial claimant of the global welfare, 
thus under-invests. 

Privatization is the optimal ownership scheme if the cost cutting externality is lim- 
ited and innovation is unimportant. Public ownership is optimal when the innovation 
externality over the firm is limited and cost cutting is unimportant. If the externality 
are large, regulation can be optimal. 

14.4 On Transaction Cost 

As we show in Table 14.2, Coase (1937)'s study of the "make or buy" decision has several 
interpretations. The terms of the alternative are between a flexible but risky scheme 
(outsourcing, privatizing) and a safe but inflexible one (in-house, bureaucracy). We shall 
study two models of this dilemma, one emphasizing the adaption to unexpected events, 



the other the limits to contractual design. 



contracting mode (m) 


internal (z) 


external (e) 


Production 


make (in-house) 


buy (out-source) 


Hierarchical Organization 


centralized 


decentralized 


Procurement 


costs plus 


fixed price 


State public service 


public provision 


private delegation 



Table 14.2: Contracting modes 



14.4.1 Contractual Design 

Procurement refers to the situation whereby a principal hires an agent to perform a task 
or realize a project. 15 ® As mentioned in ch. 7, a contender will make both a productive 
investment to develop a good project and a power (lobby) investment to gain influence 
over the jury. If influence and capture play a role, the winner enjoys a privilege because 
he won only thanks to his personal connections with the jury members, not on account 
of objective economic qualities. 

One way to circumvent this inefficient (and unfair) outcome is to force by decree the 
standardization of the procured object over a large economic area (e.g., EU, US) to create 
a market for an homogeneous good or service that can then be served by many firms on 
price terms only (e.g., in a procurement auction). A liquid market active all year long 
will emerge and enable entry and thus price adjustment downward to the long term 
cost of procurement. The drawback of this new regulation is the rigidity of the product 
description that will never perfectly fit the need of each buyer. 

When the project is idiosyncratic and cannot be standardized, the relationship be- 
tween the principal and the agent becomes one of agency, examples of which include: 

• A contractor builds a museum on behalf of a government. 

• A computer maker hires an electronic firm to design a new graphical chip. 

• A fashion designer hires a cheap manufacturer to produce articles bearing his/her 
name. 

• A high-tech firm hires a low cost manufacturer to assemble the product it designed. 

The realization of such projects is always plagued by imponderables and uncertainty. 
For instance, 



• The original architectural design is impossible to carry on as such; a reinforcement 
of the building's infrastructure is needed. 16 ® 



• A new regulation prohibiting certain designs or heights for buildings might be 
passed after planning but before completion. 

• Archeological remains might be found when digging for the foundations, forcing a 
delay and extra cost to preserve them (whenever this is mandatory). 

• A complex electronic device is sensitive to the (variable) price of a metal component. 

When the agent is paid for the project on the basis of costs plus a bonus (denoted C+), 
the relationship is close and flexible as if the agent was a division within the principal's 
firm; this situation is akin to internal provision and characterized by authority of the 
principal over the agent. If, on the contrary, the parties sign a fixed price contract (de- 
noted FP), their relationship is at arm's length i.e., distant and inflexible ; this is akin to 
external delegation and relations are characterized by mutual agreement or bargaining. 

The "make or buy" trade-off is as follows: FP provides better ex ante incentives than 
C+ for minimizing production cost, it also avoids the excessive administrative cost typ- 
ical of an internally run operation. Yet, whenever unforeseen events force parties to 
change their original project ex-post, C+ displays a greater adaptability than FP which 
translates into a lower transaction cost of adaption. 

Ex-post Uncertainty and Opportunism 

We now study formally the development of this contractual relationship following Bajari 
and Tadelis (2001)'s model. The completed project has a value V but unexpected contin- 
gencies arise during the realization forcing the parties to modify the project if they want 
to finish it. 

The cost of adjusting the project is denoted F m for modes m = i, e. When parties are 
integrated or use a C+ contract, the agent applies the necessary modification and the 
principal pays the true adjustment cost Fj. Whenever the agent works as an external 
contractor he is supposed to support all costs. Yet, as the modification is due to events 
not contemplated by the original contract, the agent can ask for a renegotiation at which 
point he will act opportunistically 11 ® and this will generate an inefficiency in the sense 
that the adjustment cost will increase towards F e > F ; . 18@ The agent is typically less 
impatient than the principal regarding project completion; he can thus extract an addi- 
tional payment (holding-up the principal) but in the course of this haggling, time and 
resources are lost. Alternatively, his external position enables him to inflate the adjust- 
ment costs billed to the principal and divert part of this inflation. Once again, there 
is an inefficiency because some of the bill inflation corresponds to an inefficient use of 
resources such as the substitution of a competitive subcontractor by a family related 
one. Another prevalent explanation for the adjustment cost increase is the presence of 



asymmetric information among the parties; it is developed at the end of the section. To 
conclude, the weakness of arms length relationship lies in its inherent rigidity. 

Design Complexity 

To avoid paying for the modification cost F m , the principal can design a very precise 
project that contemplates many future contingencies. The quality of a design is mea- 
sured by the probability t that no modification will be needed later on. However, the 
better the design, the costlier it is; we assume that the fixed cost of a design of quality r 
is d{r) = -aln(l-T) where a is an exogenous indicator of the complexity of the project. 

Under a FP contract with price p, the agent supports all (scheduled) construction 
costs and is the residual claimant of any effort or investment to bring them down. Sup- 
pose then that the anticipated project cost is c e . The payoffs are n e = V- d{r) - (1 - j)F e - p 
and u e = p-c e . The principal chooses the price p to meet exactly the agent's participation 
constraint u e > 0, hence her profit is 

n e = V-d{T)-{l-T)F e -c e (14.4) 

Under a C+ contract, the principal pays all the expenses so that the agent does not 
care to contain cost. As shown in ch. 20 on moral hazard, such an attitude leads to 
an inflated project cost ct > c e . The payoffs are m = V - d{r) - (1 - r)Ft -ci-p and ui = p. 
Now, the price p of the C+ contract is chosen to meet exactly the agent's participation 
constraint u > so that the principal profit is 

n i = V-d(T)-a-T)F i -c i (14.5) 

Comparing (14.4) with (14.5), we see that the upside of delegation (outsourcing) is the 
lower ex-ante project cost while the downside is the greater ex-post adjustment cost. 

For the same quality of design t, the FP contract dominates the C+ contract if the 
cost savings c ; - c e from better work incentives are larger than the expected inflation of 
adjustment costs {l-T){F e -Fi); we immediately notice that this condition is true for a well 
designed project i.e., a large t. To make a more precise comparison, we must take into 
account the fact that the optimal design will differ according to which contract governs 
the relationship. For m = i,e, the optimal design solves d' = F m (cf eqs. (14.5) and (14.4)) 
and is t ot = 1 - -£- (assuming < a < Ft). We can make two observations; firstly, as F e > F t , 
a FP contract leads to choose a better design than a C+ one (for a given complexity) and 
secondly, the design quality decreases with complexity whatever the contract governing 
the relationship. Plugging the minimized design cost d{r m ) in the principal's payoff, we 



obtain n = V - a (1 + In (a)) + aln(F m ) - c m for m = i, e. The comparison of the two contracts 
becomes 19 ® 



n e > Jii o Ci - c e > a\n{F e IFi) o a< a* = 



(14.6) 



ln(F e )-ln(Fi) 

The general conclusion can be seen on Figure 14.2: as complexity increases from none 
to infinite, the optimal contract is initially FP with a decreasing quality design. At the 
threshold a*, there is a change of regime toward a C+ contract together with a drop in 
design quality; further on, design quality continues to decrease. The force of the model is 
not so much to corroborate these intuitive findings but to relate the cut-off position with 
the fundamentals of the model in equation (14.6). 



A design quality % 




m buy 



•4 make 



Figure 14.2: The "make or buy" Decision 



Discussion 

When applied to the " "make or buy" " quandary, this result tells us that an easy to define 
component will be bought on the market, whereas a complex component will be procured 
internally. For the aerospace industry, Masten (1984) shows that both a higher degree of 
specialization (specificity) and a higher level of complexity will increase the probability 
of internal procurement. For the automobile industry, Monteverde and Teece (1982) 
show that more complexity, identified by more engineering investment, will increase the 
likelihood of internal procurement. 

An obvious corollary noted by Gibbons (2005) is that the firms we observe are less 
efficient than the markets we observe, even though the firms we observe are more effi- 
cient than the markets they replaced. On Figure 14.2, the design quality of observable 
C+ contracts is worse than that of observable FP contracts although the observable C+ 
contracts fare better than the (non observed) FP alternatives. Applied to public services, 
the publicly provided services we use are of worse quality than the delegated ones we 



use although better than the privatization alternative (we do not use). As we comment 
in §2.4.3 (cf. Figure 13.3), the raw comparisons of the two modes can suffer from sample 
selection bias (cf. discussion at the end of §13.3.3). 

Asymmetric information 

The previous model entirely rests on the greater inefficiency created by an external pro- 
curement when adjusting for unexpected events. We show that asymmetric information 
is a likely cause of inefficiency. 

The fact that only the agent knows without doubt the true adjustment cost F t creates 
an asymetry of information that inflates adjustment costs in the external contracting 
mode. As we explain in §2.4.3, when the (uninformed) principal bargains with the (in- 
formed) agent over the compensation for the adjustments to the project, she can either 
under or over estimate the additional cost and thus offer a compensation lower or greater 
than the true cost supported by the agent. Alternatively, she rejects an offer that is more 
or less than the true cost. This means that a renegotiation failure occurs with probabil- 
ity a, in which case the project is not completed and its entire value is lost. Although 
we do not know the respective payoffs u R and n R of the agent and the principal after a 
sucessful renegotiation, we still know that their sum is V - F t since they bargained over 
a monetray transfer. 

The ex-ante payoffs are then n e = tV + (1 - t)(1 - a)n R - p - d[r) and u e = p - c e + (1 - 
t)(1 -a)u R . The principal chooses the price p to meet exactly the agent's participation 
constraint u > 0, hence her profit is n e = tV + (1-t)(1-oO(V - F,)- c e - d{r) which is identical 
to (14.4) whenever F e - (1 - a)F t + aV> F t . 

14.4.2 Limited liability 

From an incentives point of view, carrots and sticks are identical as they work through 
differential payments. They diverge however in the absolute base level and the condi- 
tioning event; indeed, the carrot is paid frequently as it rewards each good performance 
while the beating with the stick is applied infrequently since its role is to deter shirking. 
It thus seems clear that the stick ought to be the preferred instrument for the princi- 
pal whenever available. The problem is that for a variety of reasons, sticks are hard 
to come by with. When an individual has a limited liability i.e., scant wealth, impos- 
ing a negative payment upon him is akin to slavery; in other words, the only way to 
lower the utility of a destitute is physical or psychological harm. In market democracies, 
principles of higher order than economics prohibit exploitation or coercion in labour con- 
tracts involving private parties: the worse for an employee is thus dismissal and a mild 



shame exposure. This also means that a large and wealthy corporation might engage 
into harmful activities because although it is liable with almost no limit, its board of 
directors is not. 20@ It is for that reason that some actions deemed harmful to economic 
welfare (e.g., fraud) are punished by jail terms to deter the individuals at the command 
from perpetrating them. 

Limited liability, by restricting the use of sticks, gives rise to a new transaction cost 
insofar as the necessary rise of the base salary makes contracting dearer to the firm. 
We might however qualify this transaction cost of limited liability as an exogenous one 
since it is somehow imposed on the legal system regulating private contracting by moral 
considerations. We shall now see that there exists an endogenous transaction cost of 
limited liability resulting from an internal weakness of the legal system which is quite 
similar to the issue of rent-seeking (cf §16.3). Whenever a party to a contract, implicit 
or explicit, faces a lawsuit for wrongful doing and anticipates an expected loss e.g., a fine, 
a damage payment or a bad reputation, it has an incentive to contest the outcome, with 
a view to overturn it or to reduce the loss. Whereas "rent-seeking" firms contest a State 
awarded rent (a carrot), a "penalty-avoiding" firm contests a State punishment (a stick). 
As in the case of rent-seeking, penalty avoidance can run through legal or illegal means. 
In the first category, we find lobbying for soft laws and regulations, delaying through 
appeals or establishing the legally responsible entity in a hard to prosecute overseas 
territory. In the second category, corruption and outright violence are of the essence. 

Glaeser and Shleifer (2003) show how the legal system is optimally changed from 
a flexible to an inflexible one when limited liability becomes a more severe constraint 
i.e., when the maximum fine that can be realistically levied upon a firm shrinks (as a 
proportion of GDP or sales). Consider some activity generating a risk of accident. Safe 
firms have a constant accident probability po. Risky firms display a higher probability 
p n but can adopt a safety standard to reduce it by pi down to p n - pi (possibly lesser than 
po). The cost of complying with the safety standard is c. The available legal rules to cope 
with accidents are: 

• Laissez-faire: no responsibility whatsoever 

• Regulation: mandatory safety standard with a fine for discovered violators 

• Negligence: in case of accident, the fine applies to non safe firms 

• Liability: in case of accident, the fine applies 

The degree of "law and order" in the legal system is measured by the maximum fine 
F that will not be challenged by capture or corruption. For a risky firm under the liabil- 
ity regime, the expected fine saving of adopting the safety standard is piF while under 
negligence it is the greater p n F because once the safety standard is installed, the firm 



is never fined. Lastly, under regulation, the expected fine saving of complying is p r F 
where p r is the probability of control (audit) by authorities. We assume pi<p n < Pr- The 
first inequality means that the safety measure does not eliminate risk while the second 
means that controlling for the presence of the standard (ex-ante) is comparatively easier 
than proving responsibility in case of accident (ex-post). 

For all legal rules, a firm compares the expected fine saving to the cost c of adopting 
the safety standard. What matters is the position of the ratio £ with respect to the 
aforementioned probabilities. Clearly, when F is small, the government is powerless and 
cannot promote the adoption of the safety standard. The optimal policy is then laissez- 
faire as it avoids the administrative cost of setting up a true policy but also the wasteful 
corruption that such a policy would trigger. As F increases, the safety objective becomes 
feasible with regulation, then with negligence and lastly for F large with any of the three 
instruments. There is however a social ranking of the legal modes. 

Liability is a first best policy because it only motivates risky firms to adopt the safety 
standard, so that no safe firm is forced to waste resources in adopting a standard that 
does not reduce their (already low) riskiness. Negligence is also first best when the max- 
imum fine is intermediate with -£- < F < -£- o p n F > c> poF. This is so because the LHS 
inequality guarantee adoption by risky firms while the RHS one means that safe firms 
are not (wastefully) lead to adopt the standard. Lastly, regulation is only second-best 
as it forces safe firms to adopt the safety standard, which is a social waste of resources. 
Based on our assessment of what can be done according to the level of the fine and the 
social benefit of each policy, we conclude that 21@ 

The optimal legal system is 
Laissez-faire in powerless states (F < -£-) 

Pn 

Regulation for low level of enforcement (-f- < F < -£-) 

Pn Pr 

Negligence for intermediate level of enforcement (-^- < F < -j-) 
Liability for high level of enforcement (F > — ) 



Since the cost of adoption c is roughly proportional to firm size q, Glaeser and Shleifer 
(2003) argue that industrialization amounted to a rising q accompanied by a proportional 
rise of harm so that damages awarded by justice ought to have grown proportionally; yet 
the cost of subverting justice remained stable so that the maximum fine F remained the 
same. This evolution of the underlying parameters then made the change from liability 
to negligence and lastly to regulation optimal in the US legal system which rationalizes 
the actual evolution at the turn of the XX th century. At the same time, the US congress 



passed laws aiming at increasing F by recalling corrupt judges and by developing bu- 
reaucracy (federal commissions) which is a State apparatus more immune to influence. 
The recess of regulation over the last twenty years can also be interpreted in the light of 
this model as a decrease of the cost c of meeting the highest safety and quality standards 
and, as an increase of the maximum fine F due to the consolidation of democratic institu- 
tions free from the influence of big trusts (relative to the situation a century ago) i.e., the 
large fines imposed on corporations by anti-trust authorities or the damages awarded by 
popular juries are not disputed beyond legal wrangling. 

14.4.3 Employment vs. Performance 

Using a model of incomplete contracting under moral hazard (cf ch.20), Levin and 
Tadelis (2010) show that the employment vs. performance dichotomy regarding labour 
contracts illustrates the internal vs. external provision ("make or buy" ) as well as the 
private vs. public provision. 

A principal contracts the services of an agent to provide a service of measurable qual- 
ity q such as health of patients, educational achievement of students or satisfaction of 
customers. The agent's activity or effort has two dimensions, one extensive and con- 
tractible L akin to time spent on the job while the other one is intensive and non con- 
tractible beyond a minimum K ; it is akin to human capital or organizational design. The 
technology transforming the agent's inputs into the output is a standard Cobb-Douglas 
q = KL. We shall see later that a quality objective q can be achieved by a simple em- 
ployment contract stipulating assistance L = q/K . Writing and enforcing a contractual 
requirement is always costly but extensive effort L is obviously less difficult to monitor 
than the quality index q. 22@ We assume that the differential cost between the latter and 
the former is an increasing convex function d{q) of the final quality q desired by the prin- 
cipal. This is so because greater quality requires more and more difficult and complex to 
describe sub-objective and tasks. 

The agent's cost per unit of time is the increasing convex function c{K). We assume 
that competition drives the market wage to c{K Q ). A contract is {w,L,q) where L and q 
are minimum requirements and where the salary or fixed fee w is tuned to guarantee 
participation of the agent. We prove first a simplifying principle: it is useless to set both 
a time and a performance requirement. 

For a low quality requirement q < K L, the optimal investment is the minimal one 
K* = K since the required output is met with the required input i.e., no extra costly 
investment is needed. Setting q = would thus yield the same outcome. For a large 
quality requirement q > LK , we show that the input requirement can be removed (set 



L = 0) by considering where lies the optimal investment K*. If K*L < q then the agent 
voluntarily chooses L* > L because both K and L are costly i.e., must maintain KL = q. 
Hence, setting L = has no effect. If K*L > q then the agent would like to pick L* < L 
which means that imposing the input requirement L>L forces the agent to spend more 
extensive time in order to attain the output requirement and this is also costly for the 
principal since w must then be adjusted upward. 

We have thus shown that setting either L or q at zero can only improve the outcome 
for the principal. Observe then that L = is akin to a performance contract since only 
the output is paid for while q = is akin to an employment contract. The cost w pe of 
achieving one unit of output under a performance contract is the minimum over all pairs 
{K,L) of the agent's cost c{K)L under the constraint KL > 1 which necessarily binds at 
the optimum as we already argued; the agent thus minimizes £ ^f L i.e., choose the opti- 
mal investment K* equalizing average and marginal cost of human capital. 23 ® The idea 
here is that once delegated the service provision, the agent chooses an optimal mix of 
extensive and intensive efforts in order to produce any additional level of quality. The 
moral hazard problem is alleviated because the agent is the residual claimant of any cost 
reduction. The cost of the performance contract is thus w pe = c'{K*) = ^r-. Under an em- 
ployment contract, the cost of achieving one unit of output is simply w e ' 
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since K* minimizes the average cost. The performance contract leaves the agent free to 
choose both K and L while the employment one somehow fixes K down to its contractible 
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Figure 14.3: Employment vs. Performance 



To compute the marginal cost of quality for the principal, we must take into account 
the cost d{q) of designing a specific objective so that C m (q) = min{w em , w pe +d'{q)} as shown 
on Figure 14.3 by the continuous increasing curve. The principal gross payoff is V{q) 
which can either be a profit or the consumer surplus generated by service of quality q. 



We assume decreasing returns to quality i.e., the marginal benefit V m decreases with 
quality. Interpreting the intersection of C m with V m and an alternative V m where quality 
is more valued by end-users, we can conclude: 

I If the principal cares moderately for quality, a performance contract is optimal while 
if quality is crucial for the service, employment is the optimal scheme. 



Chapter 15 
Horizontal Integration 



Horizontal Integration deals with mergers and acquisitions (M&A) between firms in- 
volved in the same markets. We firstly presents some stylized facts on the huge size of 
firms in advanced economies and discuss the incentives to grow bigger. 

We then inquire about the efficiency effects of mergers and the resulting intromission 
of public authorities. Recall indeed that when two competitors in a given market merge, 
the competitiveness of the market is reduced and so is its productive efficiency. From the 
legal point of view, a merger strengthens or creates a dominant position which may give 
rise to abuse. 1 ® 

A related issue is the merger paradox according to which the gain in market power 
from a merger is not a sufficient reason for making the operation profitable. For instance, 
the newly formed company might use cross-subsidization or simply its bigger size to 
unlawfully increase its market power. For these reasons, mergers and acquisition are 
carefully watched by antitrust authorities which leads to legal disputes centered on what 
exactly is the relevant market, its participants and the extent of one firm's harmful 
market power. 

The last two sections deal with the practical methods used to define a market and 
measure market power. 

15.1 Merger Activity 

15.1.1 Large Firms 

We provide here some limited stylized facts, more detailed tables are available. 

Forbes magazine compiles information about the 2000 largest firms in the world since 
2003. 2@ Table 15.1 displays the distribution among countries and sectors for the period 
2004-2010. All figures are % of group total, all tables are sorted by decreasing share of 
profit. The HHI index of concentration (cf §15.3) for countries, whether computed on # of 



firms or sales, decreases from 2500 to 1500 over the period. Among sectors, concentration 
is around 600 and slightly increasing. 



Loc 


Firms 


Sales 


Profits 


US 


32.5 


34.7 


33.6 


UK 


5.9 


6.9 


7.0 


JP 


14.8 


14.1 


6.4 


FR 


3.3 


6.6 


4.9 


DE 


3.0 


6.4 


3.9 


CN 


2.8 


2.1 


3.7 


CA 


2.9 


2.1 


2.9 


RU 


1.0 


1.0 


2.8 


CH 


2.0 


2.3 


2.8 


ES 


1.6 


1.9 


2.6 



Industry 


Firms 


Sales 


Profits 


Oil & Gas 


5.5 


12.2 


16.8 


Banking 


15.8 


10.3 


16.3 


Pharma & Biotech 


2.1 


2.8 


5.8 


Food & Tobacco 


5.3 


6.7 


5.6 


Utilities 


5.9 


5.0 


5.5 


Materials 


5.7 


3.8 


5.3 


Insurance 


5.4 


7.3 


4.9 


Div. Finance 


8.1 


4.6 


4.7 


Telecom 


3.4 


4.2 


4.3 


Conglomerates 


2.1 


2.9 


3.5 



Table 15.1: Distribution of Large Firms 

Table 15.2 presents the winners and losers among countries and sectors during this 
period with the mean annual percentage change of the number of firms present in the 
Forbes 2000 list. 3@ 



Win 


% 


CN 


30.4 


LU 


15.9 


RU 


14.4 


BR 


11.1 


CL 


10.1 


IN 


9.5 


IL 


7.0 


AT 


6.6 


HK 


5.5 


BM 


5.1 



Lose 


% 


UK 


6.2 


US 


4.9 


NL 


4.8 


JP 


2.4 


DE 


2.0 


SE 


1.6 


IT 


1.4 


FI 


1.3 


CA 


1.2 


AE 


0.0 



Win 


% 


Capital Goods 


6.1 


Oil & Gas 


4.9 


Materials 


4.2 


Chemicals 


2.9 


Defense & Space 


2.5 


Div. Finance 


1.7 


Telecom 


1.0 


Trade 


0.4 


Conglomerates 


0.4 


Food & Tobacco 


0.3 



Lose 


% 


Hotels & Leisure 


9.0 


Health Care 


4.7 


Automotive 


4.3 


Retailing 


3.9 


Business Services 


3.2 


Media 


2.6 


Semiconductors 


2.4 


Technology 


1.6 


Software 


1.4 


Consumer 


1.4 



Table 15.2: Winners and Losers: Countries & Sectors 



Table 15.3 shows the average year-to-year rotation over the period and the proportion 
of firms who have stayed in the entire period in a given tier. The first two columns of 
Table 15.4 presents, among firms always present in the list, those who have climbed and 
tumbled most the ranking over the period. 4@ For instance ATT climbed from the 400's to 
the top-ten while Citigroup fell from the top spot to the 400's. The impact of the financial 
crisis is most apparent for losers while telecommunication and energy firms are those 
most profiting. Regarding entry and exit, the last columns show the over and under 
achiever. Most exits are due to M&A such as SBC buying ATT in 2005, then rebranding 
as ATT and buying Bellsouth in 2006. Bank One was bought by JP Morgan in 2004. 

In a different tack, Table 15.5 selects, over the period, the largest gains and losses and 
then sums these among firms in order to display the over and underachievers (in mean 



bn$ per year). Vodafone is noticeable for appearing both as a large loss maker and a 
high climber of the ranking. Although it became profitable lately, it has always retained 
a large market capitalization (investors trusted it would become profitable). Table 15.6 
displays which firms can be deemed the "big players". 



Top 


Rotation 


Permanence 


500 


75 


21 


1000 


52 


38 


1500 


30 


51 


2000 


12 


58 



Table 15.3: Movement within the list 



Climbing 



ATT 



Suez 



Telefonica 



Apple @ 



Vodafone 



Xstrata 



France Tel 



Vale 



America Movil 



Generali 



Tumbling 



Citigroup 



American Int 



Fannie Mae 



Freddie Mac 



ING@ 



RBS@ 



UBS® 



Toyota 



Daimler 



Altria 



Entry 



ICBC 



China Cons Bk 



Bof China 



EDF 



Inbev ' 



Rosneft @ 



Itausa 



Bof Comu 



Kraft Foods 



Saudi Basic Ind 



Table 15.4: Moving on the ladder 



Gain 


bn$ 


ExxonMobil @ 


32.4 


Shell @ 


19.9 


BP@ 


18.4 


General Elec @ 


17.1 


Chevron @ 


14.0 


Total @ 


13.4 


Microsoft @ 


13.4 


Gazprom @ 


12.1 


Petrochina @ 


12.0 


WalMart @ 


11.7 



Exit 



SBC@ 



HBOS 



Wachovia @ 



Merrill Lynch 



ABN Amro 



BkOne 



Washington Mut @ 



Bellsouth 



Fleetboston Fin 



Lehman Bro 



Loss 


bn$ 


Fannie Mae @ 


-18.9 


American Int @ 


-15.7 


Vodafone @ 


-13.6 


General Motors @ 


-11.7 


Freddie Mac @ 


-10.7 


RBS@ 


-5.9 


Fortis @ 


-5.6 


Nextel @ 


-5.0 


Viacom @ 


-4.6 


Ual@ 


-4.3 



Table 15.5: Average maximum Gains and Losses 



Table 15.8 displays employment over the last five years using the Fortune Global 500 
list of public companies. 5 ® Table 15.9 pictures the returns per employee 6 ® which are 
overtly dominated by mineral resources companies with exceptions from GoldmanSachs, 
Google, Microsoft or Apple as can be seen from Table 15.9 which averages performance 
over 2008-2010: left panel for large employers and right panel for mid-sized ones. 



Revenue 


bn$ 


WalMart @ 


342 


ExxonMobil @ 


316 


Shell @ 


302 


BP@ 


273 


Toyota @ 


252 


Chevron @ 


179 


Total @ 


167 


Daimler @ 


159 


General Elec @ 


159 


Ford Motor @ 


159 



Asset 


bn$ 


RBS@ 


1996 


Bnp Paribas @ 


1919 


Barclays @ 


1848 


HSBC@ 


1736 


Citigroup @ 


1730 


Deutsche Bk @ 


1572 


UBS@ 


1495 


ING@ 


1481 


Bof America @ 


1479 


JP Morgan @ 


1464 



Stock 


bn$ 


ExxonMobil @ 


366 


General Elec @ 


273 


Microsoft @ 


252 


Petrochina @ 


219 


WalMart @ 


207 


Shell @ 


189 


BP@ 


172 


Johnson&Johnson @ 


171 


Procter&Gamble @ 


170 


Pfizer @ 


164 



Table 15.6: 2004-2010 Mean Revenue, Assets and Stock value 



Firm 


Rank 


General Elec @ 


1 


ExxonMobil @ 


2 


HSBC@ 


3 


Shell @ 


4 


BP@ 


5 


Bof America @ 


6 


JP Morgan @ 


7 


WalMart @ 


8 


Berkshire @ 


9 


Total @ 


10 



Firm 


Rank 


Bnp Paribas @ 


11 


Chevron @ 


12 


Santander @ 


13 


IBM® 


14 


Verizon Comu @ 


15 


Wells Fargo @ 


16 


Samsung @ 


17 


Barclays @ 


18 


Eni@ 


19 


Procter&Gamble @ 


20 



Firm 


Rank 


NTT@ 


21 


Pfizer @ 


22 


Petrochina @ 


23 


Nestle @ 


24 


Goldman Sachs @ 


25 


Axa @ 


26 


Siemens @ 


27 


Microsoft @ 


28 


Toyota @ 


29 


Deutsche Bk @ 


30 



Table 15.7: Average Rankings 



Apart from the Chinese newcomers, most firms in the previous tables are US or Eu- 
ropean. The rest of Asia also host very large undertakings but they are organized in a 
different manner. Japan is the paradigmatic example. When the country opened up to 
the western world during the XIX th century, family-controlled vertical conglomerates(cf 
zaibatsu) came to dominate the economy. During the post-1945 economic recovery, six 
major corporate groups (cf Keiretsu) emerged: Mitsui, Mitsubishi and Sumitomo, Fuyo, 
Sanwa and Dai-ichi Kangyo. Each is led by a bank providing finance and a general trad- 
ing company (cf. sogo shosha) coordinating business deals. Being organized as a trust, a 
keiretsu is immune to hostile takeover. Although their weight in the economy is decreas- 
ing, JFTC (2001) reports that they still occupy 13% of the total capital of the Japanese 
companies and make 11% of total sales. Other important (and newer) Keiretsus are ver- 
tically integrated. In the Automobile industry we find Toyota, Nissan, Honda, Daihatsu, 
Isuzu while in the electronics industry we find Hitachi, Toshiba, Sanyo, Matsushita and 
Sony. 



Firms 


Emp. 


Wal Mart Sto @ 


1991 


State Grid @ 


1381 


China Nat Petro @ 


1313 


US Postal @ 


764 


Sinopec @ 


664 


Carrefour @ 


472 


Deutsche Post @ 


463 


Agri Bk of China @ 


452 


Hon Hai @ 


448 


China Tel @ 


439 



Firms 


Emp. 


Siemens @ 


430 


Gazprom @ 


421 


United Parcel @ 


419 


McDonald's @ 


417 


Compass @ 


391 


Hitachi @ 


378 


ICBC@ 


374 


IBM® 


374 


Tesco @ 


354 


Target @ 


352 



Table 15.8: Top Employers (in thousand) 



Large Firms 


7r/Emp. 


Petronas @ 


376 


Exxon Mobil @ 


358 


Goldman Sachs @ 


292 


BHP Billiton @ 


277 


Chevron @ 


266 


Shell @ 


231 


BP@ 


222 


Statoil @ 


210 


Petrobras @ 


210 


Microsoft @ 


180 



Small Firms 


s/Emp. 


EnCana @ 


923 


Occidental Petro @ 


587 


Petro Canada @ 


417 


CNP Assurances @ 


346 


GasTerra @ 


283 


PTT@ 


247 


Formosa @ 


231 


Murphy Oil @ 


199 


SABIC @ 


196 


Legal & General @ 


192 



Table 15.9: Top Profits in 1000$ per Employee 



15.1.2 Reasons to merge 

As we shall see later on with the merger paradox, mergers are hardly motivated by 
the desire to increase market share or market power when competition is driven by 
production capacities. In chapter 24 on standards and components, we explore some of 
the technological motives for mergers; a small list is: 

• Fixed cost elimination 

• Economies of scale and scope (synergies, downsizing the workforce) 

• Network effects (complementarity of products or imposing a standard) 

Holmstrom and Tirole (1989) explore an altogether different driver for mergers which 
is based on managerial incentives (cf§23.3) and the desire to expand the size of the 
business they manage. Among the reasons why managers tend to oversize the firm they 
run, we find 

• The technology may be such that the effort of the manager and the size of his staff 
are complementary, hence staff inflation frees the manager from stress and hard- 
work. 



• Staff size may influence the market perception of the manager's ability. A powerful 
king is always followed by many courtesans, thus the display of courtesans might 
signal power. 

• A larger staff makes it more costly to fire the manager since they all have to go 
with him (being inefficiently numerous). 

• Increasing the firm's size tend to relax competitive pressure and it is often a strate- 
gic move used to prevent entry (cf Part 10). 

• Whenever managerial compensation is linked to the firm's market share there is 
an incentive to grow bigger as shown in §6.4. 

• A bigger staff contains more high ranking level jobs which motivates subordinates 
(middle managers) longing for promotions. Inversely, rewards through promotion 
rather than year-to-year bonuses force the firm to grow in order to supply the new 
positions that such promotion-based reward systems require. 

To grow bigger a firm has two basic options, internal and external growth. The former 
means expanding sales using its current assets (capital and employees) to conquer new 
markets or increase market shares in core markets. The latter method consists in ac- 
quiring or merging with another company, a way chosen by a majority of the companies 
listed in the above tables. 

The creation of large unified commercial areas in Europe or North-America has trig- 
gered a wave of mergers and acquisitions over the last 20 years. Harford (2005) examines 
the causes and timing of industry-level merger waves; his statistical analysis reject a 
standard explanation, that mergers are triggered by (too) high stock market valuations. 
Rather, he concludes that industry merger waves occur in response to economic, regula- 
tory or technological shocks that require large scale reallocation of assets. However, a 
sufficient liquidity in capital markets is necessary for the wave to actually take place, 
in order that transaction cost do not hinder the fundamental motives of mergers. This 
condition causes industry merger waves to cluster in time even if industry shocks do not. 
The following alphabetical listing of cases and their origin is Harford (2005)'s table 2. 



Aircraft (1999): Big, older fleets require increased maintenance, repair and overhaul. In- 
creasingly outsourced from carriers, who want "one-stop shops". 
Banking (1985): Deregulation allows interstate banking, particularly in California. 
Banking (1996): Deregulation and Information Technology. 
Business Services (1986): Partially IT-driven mergers as IT becomes important. 



Business Services (1998): Fragmented, smaller players combine, share cost structures, offer 
more complete line of services to customers; industry grows as outsourcing takes off. 

Business Supplies (1997): Paper and pulp industry consolidates from fragmented price tak- 
ers to gain market power and avoid costly duplication of capital intensive production 
facilities. 

Candy & Soda (1992): Snapple and other non-carbonated beverages make strides, leading 
to activity to beat or buy them. 

Chemicals (1995): Large cash flows, over capacity in production, need to consolidate re- 
search. 

Communication (1987): Break-up of AT&T in 84 was followed by entry into long distance, 
investment in fiber optic capacity, etc. 

Communication (1997): Telecommunications Act in 1996, consolidation, technological 
changes. 

Computers (1998): Internet 

Consumer Goods (1986): Mature market and the need to offer full line leads to consolidation. 

Electrical Equipment (1986): Several companies seek growth through acquisition to compete 
better with industry leaders Westinghouse and General Electric. 

Electronic Equipment (1999): OEM's growth leads to demand for electronic equipment man- 
ufacturers to shift from small regional players to larger global players capable of infras- 
tructure, IT, etc. to grow with their customers. 

Entertainment (1987): Deregulation allows firms to own many stations. 

Entertainment (1998): Studios seek diversified production sources and strong libraries. 
Telecom act of 1996 relaxes media ownership limits. 

Food Products (1999): Retail consolidation pushes distribution consolidation and or sale of 
distributors to bigger retailers who want to buy rather than build distribution channels. 

Healthcare (1996): Service providers consolidate to have bargaining power with HMOs. 

Insurance (1998): Bigger is safer, leading to consolidation, especially in reinsurers. 

Machinery (1996): Large manufacturers decreased number of suppliers they were willing to 
deal with in bid to improve efficiency. This forced consolidation in a number of capital 
goods industries; many smaller players were bought in "roll-up" deals. 

Measuring and Control Equip (1998): Depression in semi-conductor industry (big customer). 

Medical Equipment (1998): Two motives: first, acquisitions in core areas to grow, then acqui- 
sitions outside core areas to offer broad products to increasingly consolidated customer 
base (hospitals). 

Personal Services (1996): Consolidation in legal and funeral services industries. 

Petroleum and Natural Gas (1997): Increasing prices, record drilling, increasing costs lead 
drive to increase size to be more efficient. 



Pharmaceutical Products (1998): Mid-sized companies merge to garner size necessary to 
fund increasingly large costs of development. 

Restaurants, Hotels, Motels (1985): Saturation and similarity, trends toward take-out, com- 
petition from supermarket delis. 

Restaurants, Hotels, Motels (1996): Operators such as Starwood have buying sprees. Others 
buy properties to gain sufficient bulk to compete in corporate account business market. 

Retail (1986): Shift to specialty stores as aging department stores consolidated; value of 
land and buildings in revitalized urban centers. 

Retail (1996): Strong growth and impact of internet Shipbuilding, 

Railroad Equip (1998): Shrinking defense budgets finally forced the issue of overcapacity in 
the industry. 

Steel Works (1997): Collapse in demand from Asia leads to falling prices forcing consolida- 
tion. 

Transportation (1986): Mostly still working-out issues following deregulation. 

Transportation (1997): End of Interstate Commerce Commission, overcapacity in shipping, 
open-skies agreements, railroad consolidation started with a few big mergers and then 
forced responses to balance. 

Utilities (1997): Deregulation in some markets plus elimination of a law prohibiting mergers 
between non-contiguous providers. 

Wholesale (1996): Simultaneous consolidation in several wholesale sectors as growth slows 
and firms move to add breadth, take advantage of new IT ability, grow by acquisition. 



Table 15.10: Merger Waves in the US 



15.1.3 Merger Scrutiny 

The report EC (2001) on merger and acquisitions (M&A) of 2001 ascertains between 9000 
and 17000 yearly operations involving a European firm over a ten years period. The UK 
alone accounts for one third of all activity followed by Germany (16%), France (13%), the 
Netherlands (6%) and Italy (6%). The figures for the US are slightly superior and are 
ahead the European trend by 2 years as can be seen on Figure 15.1. 

Each year, more than 15000 EU firms exchange over 5% of their titles with other 
firms (for an amount of at least 1 M€). There is a merger or an acquisition whenever a 
majority of ownership titles is involved. According to EU law, a merger has an effect on 
trade between member states if the aggregate worldwide turnover of all the undertak- 
ings concerned is more than 2.5 bn€; in that case the operation must be notified to the 
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Figure 15.1: M&A in Europe and the USA 



EC. Figure 15.2 presents the trend in merger notifications to the EC since the inception 
of the Merger Regulation in 1989 (cf. latest data). Over a total of 3368 notifications i.e., 
190 per year, 3% are withdrawn, 90% are declared out of scope or compatible, 3% lead 
merging firms to apply remedies and finally in 19 cases (a mere 0.6%) was the proposal 
prohibited. 

When competitiveness is endangered by a proposed merger or acquisition, the EC 
asks the parties to apply remedies for instance a commitment to exit from a joint- 
venture, granting access to an infrastructure or technology, terminating an exclusive 
agreement or transferring a market position (divesture). The latter instrument, by far 
the most frequent, intends to find a suitable purchaser who would be able to use the 
transferred asset to exercise a sufficient competitive force on the merging parties. For 
instance, to accept the Total-Fina acquisition of Elf, the EC required the new company 
to sell hundreds of highway gas stations in France and Spain. Similarly, the merger of 
two large tissue and diaper manufacturers, Kimberley-Clark and Scott Paper in 1996 
was accepted by US authorities upon their agreement to divest many assets including 
the second company's brand name "Scotties". A recent ex-post review of the EC regard- 
ing these remedies shows a frequent inadequate scope of the divested business e.g., the 
omission of key assets that were necessary for the viability and competitiveness of the 
divested business. 

The scale of notifications of mergers and acquisitions to the FTC in the US is about 
tenfold that of the EU in part because the threshold for compulsory notification is a 
smaller 50m$. Bergman et al. (2009) compares the EU and US merger policies. Focusing 
on dominant-firm mergers, they find that EU is tougher than the US on average and on 
mergers resulting in low market shares. Also, US policy is more affected than EU policy 
by a range of market considerations. 
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Figure 15.2: European Merger Notifications 

15.2 Merger Paradox 

Absent scale economies or issues of cost reductions, a merger always seems profitable 
given that the new firm has more market power; indeed, it can internalize the previously 
destructive behavior of its former members against each other and thereby achieve a 
higher total profit than by acting independently. This intuition is actually true but only 
temporally because it needs to be driven to its logical limit: if the merging firms change 
their behavior then the non merging ones will also react to this change. The question is 
then, is this good or bad for the insiders? The answer depends on whether competition 
is driven by prices or quantities. 



15.2.1 Cournot Competition 

A merger between firms acting in an homogeneous goods market is viable only if their 
absolute share of industry profits is bigger after the merger. Salant et al. (1983) show 
that this is very unlikely to happen in a Cournot setting, thereby contradicting the in- 
tuition presented above. The explanation of this paradox has to do with the chain of 
reactions created by the merger. 

In a Cournot setting where firms compete in quantities, merging firms (insiders) im- 
mediately reduce their production to internalize their new interdependency; this ob- 
viously increases their total profit. Yet, since quantities are strategic substitutes, the 
non merging firms (outsiders) react by expanding their output which hurts the merger's 
profit although he reacts with a second reduction of output. That will again trigger an 
expansion from the outsiders and a further loss of profit for the merger. This process 



of action-reaction will continue until a new post-merger equilibrium is found; it surpris- 
ingly happens that the merger's profit is less than the pre-merger profit of the insiders. 

This can be checked within the simple model of Cournot competition seen in §5.1.3. 
Consider n plants with the same constant returns to scale (CRS) technology that are ini- 
tially owned by n different firms competing in quantities; in equilibrium, the individual 
quantity produced by each firm is 7@ q n = g ^- and profit is % n - \q\. Whenever a merger 
occurs, the CRS hypothesis makes the new owner indifferent on how to distribute pro- 
duction among plants (cf §2.1.3 for the general multi-plant treatment). We therefore 
assume a centralized behavior: the merging firms act as a single firm which means that 
market concentration is reduced. 

In this context, a merger among k + 1 firms leaves only n-k independent firms in the 
new market configuration. The merger is viable only if the profit of the conglomerate 
7i n -ic is larger than the sum of profits of the merging entities {k+l)n n . This condition can 
also be written from the point of view of an acquirer: it is worthwhile to buy k firms only 
if the profit increase n n -k-n n due to reduced market competitiveness is greater than 
the acquisition cost. The latter is kn n given that each original owner must renounce to 
n n . In our example, the ratio (fl ~^ c) appearing in profits can be eliminated so that the 
feasibility condition reduces to 

^_^ > ^±1 ^ k > 2n + l-V4^ 5 A s k+l h s 3-^T5 

(n-fc+1) 2 (ra+1) 2 2 n 

It is easy to check that the proportion of firms participating in the merger h{n) reaches a 
minimum of | for n = 5, hence 

Merger paradox: In a market for homogeneous good where firms compete in quantities, 
no merger with less than 80% of the initial members of the industry will ever take 
place. 

Practically, this means that the only case where two firms find it profitable to merge 
is when they already control the whole market (duopoly). Reversing the formal result 
we have just obtained enables to rationalize a frequently observed conduct: a firm buys 
another one but leaves it operating as an independent subsidiary although their brands 
are competitors. Indeed, we have just seen that a full merger with integration of sales 
forces reduces the overall profit, thus it is better to keep the sales teams as independent 
competitors and streamline the production side (costs). Such a decentralized behavior is 
achievable, say, by instructing or motivating the managers of the subsidiaries to operate 
so as to maximize their individual firm profits. 8 ® 

In the automotive industry we find the case of PSA with the brands Peugeot and 
Citroen who sell very similar cars, 9@ Fiat with the quality differentiated brands Fiat, 



Lancia and Alfa-Romeo or VW AG with the horizontally and vertically differentiated 
brands Volkswagen, Audi, Skoda and Seat. This behavior from the parent companies can 
be profitable: although their subsidiaries compete against themselves they also compete 
against rivals so that the conglomerate gains market shares. Notice that when VW 
bought the Spanish firm Seat and the Slovak firm Skoda to market them in the entire 
European market, it increased overall European competition, thus reduced the cake, but 
succeeded to increase its own share in absolute value. 10@ 

15.2.2 Bertrand Competition 

For markets where firms compete in price over horizontally differentiated goods, David- 
son and Deneckere (1985) confirm that mergers increase market power under the condi- 
tion that merging firms are not able to alter product's characteristics. In that situation, 
any merger is beneficial for all firms, merging and non merging ones alike; also, the 
bigger the merger, the larger the benefit. 

To prove this result, we use the demand D{p t ,pj) = a-bpi - dpj seen for duopoly in 
equation (5.18) of §5.2.3 and extend it to oligopoly replacing the competitor's price pj 
by the average price of all other firms p-t = -^T.j^i Pj- The profit of a single firm i is 
Tti = {pi - c)D{pi,p-i) and the profit of a k firms merger (involving the first k labels) is 
n fc = Y.j^k 71 ]- The FOC for the price of an insider in the post-merger equilibrium can be 
decomposed into the direct effect of her price on her own profit plus the indirect effect of 
her price on the profit of the other merging firms; formally 

dU k dm ^ djtj 

o = — = — - + Y — - 

dpt dpi m dpi 

= a + bc-2bpi + dp-i + {pi- tfdAk (15.1) 

where X k = ^j-. Notice that for k = 1, we have the FOC of an insider in the pre-merger 
equilibrium or the FOC of an outsider in both situations. 

Since the indirect effect is positive for an insider within a merger (A k > 0), it motivates 
the insider to raise his price beyond what was optimal when he was an independent 
firm (Afc = 0). We deduce that any merger increases the best reply of any insider. Thus, 
starting from the pre-merger equilibrium p* , all insiders will increase their price so that 
all outsiders, in reaction, will increase their own (but less). Then, insiders react to the 
outsiders move and increase again their price generating a lesser raise by the outsiders. 
This process eventually converges to a post-merger equilibrium where all insiders play 
r and all outsiders play s satisfying r>s>p*. 

We illustrate this process on Figure 15.3. We use equation (15.1) with k = 1 for an 



outsider j and the fact that p-j = kr+{ "_ k 1]s = Ajt+ir + (1 - A fc+1 )s to deduce 



2bs 



a + cb + d (A fc +i r + (1 - A /c+ i) s) 
a + cb + dXjc+ir 

2b-d(l-X k+1 ) 



(15.2) 



Next, we use (15.1) with k > 1 for an insider i and the fact that p-t = X k r + (1 - X k )p to 
deduce 



[2b-X k d)r 



a+ {b- X k d)c + d{X k r + (1- A^)s) 



o r = ¥*(s) 



a+{b- X k d) c + (1 - Ajt)ds 
2fo-2Ajtrf 



(15.3) 



The pre-merger equilibrium is p* = |g% at the intersection of O and X F 1 while the post- 
merger one is at the intersection of O and x ¥ k and has the announced characteristics 
since A; > 1 implies X k > and x i' k >W 1 . 
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Figure 15.3: Merger Profitability under Price Competition 

To understand this result, we first have to recall that in a framework of Bertrand 
competition, a price increase by any firm is beneficial to all others. This, in turn, means 
that when some firms merge into a trust, each has a greater incentive to raise her price 
because her interest is now the trust's profit instead of her own. Indeed, a price incre- 
ment has now two effects. Firstly, it participates directly to the firm's profit, thus to the 
trust benefits. Secondly, it raises the profits of the other trust members. What we have 
shown is that after a merger, all trust members increase their price so that all outsiders, 
in reaction, will increase their own (but less). Then, trust members will again react to 
the outsiders move and increase again their price generating a lesser raise by the out- 
siders. This process eventually converges to a post-merger equilibrium where all trust 
members propose a price greater than that of outsiders which is itself greater than the 



pre-merger common equilibrium price. 

15.2.3 Efficiencies and Welfare 

Discussion 

Two doctrines of economic competition rival for leadership. The Structure Conduct Per- 
formance (SCP) paradigm focuses on allocative or static efficiency. It studies how firms 
compete simultaneously in the current market; its policy aim is to create workable com- 
petition in today's market. Building on Schumpeter (1942)'s "creative destruction", the 
Chicago critic holds a dynamic efficiency view; it studies how firms compete sequentially 
for the market and emphasizes innovation. A temporary monopoly that enables the in- 
novator to recoup its investment is thus seen as a necessary evil on the path towards the 
higher goal of long term progress. 

According to tenants of the SCP paradigm, mergers and acquisitions increase firms' 
market power which leads to higher prices and hurts consumers. On that ground, signif- 
icant mergers ought to be opposed. For long, antitrust practitioners were in agreement 
with this view and sought to protect consumers from monopolization. The Chicago school 
responds from two angles. The "efficiency defense" started by Williamson (1968) broad- 
ens the SCP's vision in asserting that mergers can contribute positively to welfare by 
bringing in efficiencies. This author also argues that the adequate criteria for antitrust 
authorities is (total) welfare, not consumer surplus. More radically, Demsetz (1973) con- 
tends that the SCP confuses correlation and causation. True, there is a positive corre- 
lation between market concentration and industry profitability but which one drives the 
other cannot be identified with a comparative statics exercise, the very tool used by the 
SCP paradigm. The Chicago schools then endorses a reverse causal chain: the more 
efficient (innovative) firms capture greater market shares, earn more and tend to buy 
out the less efficient firms (or drive them out of the market). The policy implication with 
respect to mergers is also reversed: allowing innovative firms to acquire obsolete ones 
promotes efficiency and ultimately welfare. 

Market Power and Cost Asymmetry 

Cournot (1838) introduces the standard model of quantity competition in his chapter 7 
and observes that a firm with low marginal cost produces more than higher-cost firms, 
and that some of the latter might be forced to exit. He also notices that a given total 
industry output would be produced at higher cost by competing asymmetric producers 
than if a monopolist made their production decisions (because their marginal costs are 



not equal at equilibrium). 

Williamson (1968) exploits this intuition and shows that if a merger generates a syn- 
ergy (i.e., marginal cost reduction) then the welfare loss due to the price increase may be 
compensated by the cost saving. Although the original model is crude, the intuition is so 
strong that it remains robust to generalization (cf next §). 

Cowling and Waterson (1976) show in a Cournotian model that the average profit- 
revenue ratio is equal to the concentration-elasticity ratio. Although no causality can be 
deduced from this formula, it has been adopted as a foundation for the "market power" 
rationale. Clarke and Davies (1982) further show that concentration increases with the 
variance of firms' marginal costs; this proves that concentration is greater when some 
firms have a cost advantage, a result that lands support to the dynamic efficiency ratio- 
nale. In the same vein, Salant and Shaffer (1999) show that if the average marginal cost 
in the industry is constant, then so are the aggregate output and the consumer surplus. 
When a shock makes such an industry more cost-asymmetric, concentration, aggregate 
profit and welfare all increase together. 11@ This finding provides a rationale for govern- 
ment support of "national champions" at the expense of other domestic firms with the 
same initial technology. 

Let us develop formally those properties of the Cournot equilibrium under asymmet- 
ric cost. There are n active firms with constant marginal cost c\ for i < n. We denote 
Q = Y,i<n tfi the aggregate output, e = -^ the elasticity of demand, H = E ; <« (^ the HHI 
concentration index (cf. §15.3), ni = qt(p - ct) the individual profit, n = Z;<„7T; the aggre- 
gate profit and c = ^'LisnCi the average marginal cost. 

The FOC of profit maximization is 

P(Q) + qtP'lQ) = Ci ^p-Ci = ^ (15.4) 

eQ 

thus 

„ = „„_„,, = £2 (!)*=, n^H (15.5) 

which is the Cowling and Waterson (1976) formula. FOC (15.4) also reads ^ = e^^-, thus 



H = £ 2^(P_^\ ^^JL = Y j ( p -c + c-c i ) 2 = n(p-c) 2 + na 2 c (15.6) 

i<n * V I e i<n 

where a 2 c is the variance of the sample of marginal costs. Observe now that summing 



(15.4), we obtain n{p -c) = j. Plugging in (15.6), we get ^- = -^ + n °~ 2 c , thus 



1 ne'er' 1 (1-ner ? 

H=- + — -£ = - + - -v 2 c (15.7) 

n p l n n 

where v c is the coefficient of variation. This is the Clarke and Davies (1982) formula. 
Salant and Shaffer (1999) observe that summing the left version of (15.4), one gets P{Q) + 
};QP'{Q) = c i.e., aggregate output Q and consumer surplus S depend on c only. Now, by 
combining (15.5) and (15.7), we get ^§n = \ + {l-ne) 2 v c . When c is constant, so are Q,p,e, 
thus welfare W = U + S increases with dispersion of technologies. The limit is reached 
when a maximum number of firms are driven out of the market (or at least lose their 
economic rent) while the remaining ones achieve zero marginal cost, a result in the line 
of Cournot's observation regarding industry cost. 

Merger Synergies 

Let us start with a typology of merger efficiencies. One speaks of rationalization when 
production is reshuffled among plants to lower variable cost or when knowledge diffuses 
to bring all plants towards the production frontier. 12@ Scale economies occur when the 
merged firm reaches the minimum efficient scale or saves on fixed cost duplication (but 
remain on the same production frontier). Lastly, there is a synergy when the merged 
firm succeeds to innovate i.e., do something impossible to achieve unilaterally e.g. cut 
slack in the organization, combine complementary hard-to-trade assets or achieve faster 
a R&D discovery. 

Williamson (1968)'s argument relies on synergies brought about by the merger. Con- 
sider a Bertrand duopoly among two identical firms; the price equilibrium is at their 
common marginal cost p* = c. The efficiency occurs as follows: by merging, the firms are 
able to improve their technology and reduce their marginal cost by 6. Being a monopoly, 
the new firm sets p" = !±f* and earns *« = (1=$*)' whereas the change in consumer 
surplus is 

AW D = - l -{p M -p*){q* + q M ) = - l -{3-3c + 8){l-c-8) 

2 o 

As /S.W = AW D + n M , welfare improves if 

1 — c — 6 

n M > -AW D o (1 - cf < 35 (2(1 -c) + 5)o —— < 3 + 2^3 o > 0.15 

6 l-c 

i.e., the cost reduction is greater than 15% of the original net WTP. 

Farrell and Shapiro (1990) show that one cannot preclude concentration, price and 
welfare from growing together i.e., the profit increase generated by efficiencies may offset 



the "dead weight" loss. Yet, they show that without synergies or scale economies, the 
market price is bound to rise so that consumers suffer from a merger. 

The first result is rather simple to demonstrate. The profit of firm i is %i = pqt - CiiqO 
while the consumer surplus Woiq) = Jq P{x)dx- qP{q) (cf. eq. (2.19), thus welfare reads 

W = W D (q) + £ * t = [ P{x)dx- £ Ciiqt) 

and the welfare change due to a change in output decisions is 

dW = P{q)dq - £ C mii dqi = £ (P{q) - C m ,i)dqi = -P'{q) £ qtdqi 

i>\ i>\ i>\ 

by (5.11). Since H = £;>i f^) , we have 

Li>i ciidqi = \d [E ; ->i q\\ = \d [q 2 H] = qHdq + \q 2 dH 

so that dW = -Hq 2 P'{q)\-9- + \^jf\ and we can have the unusual situation where the 
market output shrinks (dq < 0), concentration increases (dH > 0), yet welfare still rises 
(dW > 0) because a large chunk of production has been moved from inefficient to efficient 
firms. 

The second result is that absent synergies, total output contracts after a merger. 13 ® 
We show it when marginal cost are constant. Letting w.l.o.g. C\ < c 2 < ■■ < c n , we saw with 
eq. (5.12) that total output is q = -^{a-bc) where c = ^L;>i Cj is the average industry 
cost. 14@ Upon merging firm i with any better one, the new owner will use exclusively 
the more efficient technology i.e., shuts down the obsolete plant. We thus have index i 
disappear from the list of active firms, hence the total output becomes q = ^{a-bc-O 
where c_ ( - = ^zj-Ljvj Cj. We have 

n{n + l){q-q) = a-b{n 2 c-{n 2 -l)c_j) = a-b[n'£ d j>iCj-(n+l)Y.i?iCj) 
= a-b{nCi-E]^iCj) = {n + l)qi by (5.14) 

thus total output shrinks even if the worst technology is bought by the best one. As 
claimed, price rises and consumer surplus falls. 

Rationalization 

As shown in Boccard (2009), a simple rationalization can be welfare improving. Consider 
the demand D{p) = 1 - p, n "cutting edge" firms with marginal cost C\ and n "obsolete" 
firms with marginal cost c 2 = C\ + 5. We let Q* = a - c\ and Ql = Ql - 6 be the efficient 



(competitive) market outputs under the two technologies. The technological gap is the 
dimensionless ratio y = 2 *. 2 = -^- measures the strength of the cost advantage. 

Let us first characterize the equilibrium in the Cournot game of quantity competition. 
The FOCs of profit maximization are 2q\ -a- bc\ -{n-\)q\- nq 2 and 2q 2 -a- bc 2 - (n - 
l)q 2 -nq\ so that the equilibrium is q\ - a ~ 2 lX\ " an ^ ^ 2 = " 2n+i ~ wnere we notice that 
q\-q 2 = 8. Obsolete firms participate only ifq 2 >0oa-c 2 >6noy<^, a. condition which 
we shall assume to hold. 

Upon merging an efficient firm to an obsolete one, we still have n efficient firms but 
n- 1 obsolete ones since the new owner will use exclusively the efficient technology i.e., 
shuts down the obsolete plant. The output changes between the old and new equilibrium 
computed using (15.2.3) are Aq 2 = Aqi = || > i.e., the increased concentration benefits 
all remaining firms. Yet one obsolete firm has been shut down so that the aggregate 
change is AQ= (2n-l)^-q 2 = -^. 

The changes in the various elements constitutive of the welfare are 

ACS = (Q+!AQ)AQ=-(m7i + mfe-H)fji (15.8) 

Alii = n{2q 1 + Aqi)Aqi = n(2q 1 + Aqi)^ (15.9) 

An 2 = (n-l)(q 2 + Aq 2 f-nqj = [n-l) [2q 2 + Aq 2 )Aq 2 - q\ 

= ql{ in -l)(2 + ±)±.-l) = -(3n + l)(^) 2 (15.10) 

Since all variations in (15. 8), (15. 9), (15. 10) are proportional to *, 

—AW = -{nq 1 + nq 2 -^ i ) + n{2q 1 + £ i )-(3n + l)^ 

= nqi-{n + l + ± i )q 2 = n6-{l + ± i )q 2 (15.11) 

Let us interpret (15.11) using the rightmost expression. The first term is the positive 
welfare balance brought about by efficient firms whereas the negative second term is 
slightly more than the output of the obsolete mothballed firm. In equilibrium, an efficient 
firm produces more than an obsolete one; let us call "output gap" the difference q\-q 2 = 8. 
A merger between two asymmetric firms is welfare improving if the combined output gap 
of all efficient firms n8 is slightly larger than the output of the retired obsolete firm. 



15.3 Measures of Concentration 

The exercise of market power is primarily linked to the size of a firm so that indices 
of concentration within an industry are of crucial importance. The first task is thus to 



define precisely the market where concentration is measured. 

15.3.1 What is a Market? 

To answer this question we need to identify the firms that might sell products relatively 
similar for consumers, the later being themselves identified as an homogeneous group. 

Products 

A product or service can be perfectly described by its technological characteristics and 
the process of its production. Since a perfect substitute is impossible to find one could 
conclude that there is a unique producer in each market. Yet society does not implode 
when a product is missing because we always manage to find something similar called a 
substitute; now if any product has some, even imperfect, substitute then there is a single 
market in the whole economy! 

Those two schizophrenic views can be useful to develop the theory because they en- 
able to concentrate on other important strategic aspects. In the present section, however, 
we present the workable concept used by the EC when it needs to apply the theory to 
judge whether competition is being distorted in a market. The EC concept of relevant 
market identifies the boundaries of competition between firms on two accounts: 

• The product market comprises all those products which are regarded as substitutable 

by reason of characteristics, prices and intended use; products that could readily 
be put on the market by other producers without significant switching cost or by 
potential competitors at reasonable cost and within a limited time span also need 
to be taken into account. The legal view of the EC is interchangeability from the 
viewpoint of consumers or producers. 

• The geographic market comprises the area in which concerned firms are involved in 

the supply and demand of products in which the conditions of competition are suf- 
ficiently homogeneous and which can be distinguished from neighboring areas, be- 
cause the conditions of competition are appreciably different in those areas. 

The definition of the relevant market is the object of controversy as the following 
example makes clear. The chemical firm DuPont used to control some 75% of the US 
cellophane market in the 1950s. The US government interpreted the data as evidence 
of monopoly while a court agreed with the company that the relevant market was much 
larger because cellophane could be substituted by aluminium foil, wax paper or polyethy- 
lene. In the resulting "wrapping" market, DuPont's share was only a modest 20%. It is 



clear that a firm under scrutiny will always argue for a high degree of substitutability in 
order to mechanically decrease her market share. 

Historically Cournot (1838) and Marshall (1890) argued that a perfectly competitive 
market would be characterized by highly correlated prices i.e., the prices need not be the 
same in different locations (e.g., because of transportation cost), but they should change 
at the same time. Also, two close substitutes should have a stable relative price because 
arbitrage opportunities among consumers act as a centripetal force on prices. 

In practice, the EC uses own and cross-price elasticities of demand to check whether 
two products are close substitutes in which case they are said to belong to the same 
market. The methodology consists in applying a small but significant non-transitory in- 
crease in prices (SSNIP-test) for all product of the candidate market. If the joint profits 
of firms increase, then the candidate market can be treated as the relevant market be- 
cause the set of included products are sufficiently close substitutes among themselves 
and sufficiently distant substitutes for the other non-included products. 

Verboven (2002) applies this test to passenger cars in the EU using sales data be- 
tween 1970 and 1999 from the top 5 national markets covering 75% of European sales. 
For that period, the permanence of important international price differentials is proof of 
the existence of trade barriers, thus the geographic markets for car retailing should be 
accordingly defined as the national markets. The candidate markets are defined using 
common classifications from the specialized press. Table 15.11 displays the candidate 
markets and the econometric estimations with the own price elasticity e, the cross price 
elasticity inside the segment £,, the cross-price elasticity across segments r\ and An, the 
joint profits increase (for Germany) after a 10% joint increase of prices within one seg- 
ment. 



Segment 


Example 


e 


f 


n 


An 


Small 


Opel Corsa 


3.0 


0.01 


0.002 


16 


Compacts 


VWGolf 


2.4 


0.04 


0.002 


8.4 


Intermediate 


Peugeot 406 


2.9 


0.03 


0.002 


8.2 


Standard/luxury 


Audi A6 


5.9 


0.11 


0.001 


22 


Sports 


Mercedes SLK 


3.3 


0.02 


0.001 


5.4 


Minivans 


Renault Espace 


1.7 


0.10 


0.001 


0.1 



Table 15.11: Relevant Car Market in Europe 



The conclusion we can read from the table is that all but the minivan segments are 
relevant markets. For the failing one, a further segmentation according to size and price 
would be needed to distinguish the now large variety of monospace vehicles. 



Participants 

Since substitute products may not be already on the market, the EC distinguishes two 
kind of actors: 

• An actual competitor to a firm is another firm which is either currently active on the 

same relevant market or which is able to switch production to the relevant market 
in the short term without incurring significant additional costs or risks in response 
to a small and permanent increase in relative prices (immediate supply-side sub- 
stitutability). 

• A potential competitor is a firm that would be likely to undertake the necessary ad- 

ditional investments to enter the market in response to a small and permanent 
increase in prices. Market entry needs to take place sufficiently fast so that the 
threat of potential entry is a constraint on the market participants' behavior. The 
time period needed by companies already active on the market to adjust their ca- 
pacities can be used as a yardstick to determine this period. 

The EC then defines potential competition as the pressure exercised upon incumbent 
firms by the possibility that new or existing firms will enter their market. New entrants 
may be attracted by above normal profits made in this market by incumbent firms, pos- 
sibly as a result of weak competition. Additional firms entering the market will increase 
the overall quantity supplied with the effect that prices fall and above normal profits 
disappear. Hence, potential competition has a "disciplinary effect" on the behavior of 
incumbents. However, this threat is relatively small when entry barriers are high. 

Classifications 

Products classifications are also useful to define markets as they sort out the technolog- 
ical characteristics of products and services. 

The International Standard Industrial Classification of all Economic Activities (ISIC 
Rev. 3.1) of the United Nations has 60 divisions belonging to the 17 categories presented 
in Table 15.12. 

Two widely used classification systems are: 

• North American Industry Classification System (NAICS) developed by the member 

countries of the NAFTA agreement (USA, Canada and Mexico). 

• General Industrial Classification of Economic Activities (NACE) (acronym derived 

from the French full name) developed by the member countries of the European 
Community. 



A Agriculture, hunting and forestry 
B Fishing 

C Mining and quarrying 
D Manufacturing 
E Electricity, gas and water supply 
F Construction 

G Wholesale and retail trade, household goods 
H Hotels and restaurants 
I Transport, storage and communications 
J Financial intermediation 
K Real estate, renting and business activities 
L Public administration, defense, social security 
M Education 
N Health and social work 

O Other community, social and personal services 
P Private households with employed persons 
Q Extra-territorial organizations and bodies 



Table 15.12: ISIC Activities Classifications 

Both systems use the ISIC categories and divisions but differ in finer levels because 
NAICS is more focused on the production process. A convergence project between NACE 
and NAICS is currently under development. A wealth of information on classifications 
can be found at Eurostat's Server RAMON. 

Product diversity 

Assessing the diversity of available products in a given market is not straightforward. 
Intuitively, one would use the number of products. Applying this to the market for com- 
puter operating systems we have to count all softwares available in commercial, share- 
ware or freeware form; we end-up with a large figure but also with the feeling that the 
right one would be 3 since the most popular ones are Microsoft's Windows, Linux and 
Apple's Mac OS X. In a sense, their popularity is a demonstration of their quality and 
therefore an indicator of the real range of products available for a new user. At the limit, 
the indicator should not to jump on the day a new product is launched. Lastly, if we 
divide this market into personal and business computing, we would like total diversity 
to be the sum of product of diversity among the two classes and within them. 

To build an index Y of product diversity satisfying these properties the information 
at our disposal is the current number of products n and their market shares, hence Y 



is a function of the vector of market shares s = (Sj)j<«. The first property states that 
counting products is correct if the market is evenly shared: if Vz < n, Si = \ then Y(s) = n. 
The second property is akin to continuity of Y (small changes induce small changes). To 
enunciate the third stability property, we need to decompose the long list of n products 
into a smaller number of classes ;' = 1 to m, each one containing kj products. The class 
shares are then V; < m,aj = HiejSi while shares within classes are sj = st/aj. The class- 
stability property expressed in multiplicative form then reads Y(s) = Y(<r) x Ylj< m Y(.sJ) a J. 
As shown by Theil (1967), these axioms uniquely define our product diversity index as 
the entropy measure Y(s) = -Y\i< n {Si) Si introduced by Shannon (1948) in his theory of 
information. 15 ® 

15.3.2 Concentration Indices 

Concentration arises either where two or more previously independent firms merge (merger) 
where a firm acquires control of another (acquisition of control), or where a joint venture 
is created, performing on a lasting basis all the functions of an autonomous economic 
entity (full-function joint venture). 

Assuming that the relevant market has been defined, the active firms are identified 
and their sales over a meaningful period of study, say a year, are recorded. We can then 
rank firms according to their market shares, a number in [0; 1], from the largest S\ to the 
smallest s n . The concentration index sums the market shares of the k largest firms, 

it 
Q = I St 

i=l 

while the index proposed by Herfindahl (1950) and Hirschmann (1964) sums all squares, 

H=ts 2 t (15.12) 

Index C4 was first used in 1968 by the US merger guidelines and later replaced in 
1982 by the Herfindhal-Hirschmann index (HHI) which is H where shares are expressed 
in percentage; it ranges from 10.000 in the case of a pure monopoly to a small number 
in the case of an atomistic market (nearby perfect competition). The EC, being a junior 
antitrust body, directly adopted the latter index. For practical purposes the spectrum 
of market concentration is divided into non concentrated for HHI < 1000, highly concen- 
trated if HHI > 2000 (1800 in the US) and moderately concentrated otherwise. To get an 
idea of the meaning for these thresholds, consider n firms sharing evenly a market, we 
have HHI = 1000/ n i.e., HHI = 1000 for n = 10 and HHI = 2000 for n = 5. 



As first uncovered by Gibrat (1931) and developed §15.3.4, the distribution of firm 
sizes is highly skewed in the sense that a few large players make up most of the output 
(or whatever measure of sizes we may consider). This implies the existence of a compet- 
itive fringe whose individual contribution to HHI is almost nil (due to the compressing 
effect of the squaring). We can thus treat the firms below 1% as having a zero contribu- 
tion and fasten the analysis by focussing on the ten to twenty largest firms in the market. 
Applying Gibrat's Law stating that firm growth rates are independent of size, we obtain 
an asymmetrical distribution 16@ for which we can compute in Table 15.13 concentration 
as a function of the number of active firms. 



n 


3 


4 


5 


6 


8 


10 


15 


20 


30 


HHI 


4894 


3790 


3155 


2740 


2227 


1917 


1494 


1271 


1031 



Table 15.13: HHI under the Gibrat's Law 



In case of a merger, one considers both the post-merger market concentration and the 
increase in concentration resulting from the merger. The EC rule is slacker than its US 
counterpart and amounts more or less to accept any merger in any of the following cases: 

• post-merger HHI below 1000 

• post-merger HHI between 1000 and 2000 (1800 in the US) but with an increase less 

than 250 (100 in the US) 

• post-merger HHI greater than 2000 (1800 in the US) but with an increase less than 

150 (50 in the US) 

Another popular measure of market power, especially among economists, is the Lerner 



(1934) index (cf eq. (3.4)), U 



Pi-Cn 



L that divides the margin by the price; it is zero in 



a competitive market and nearby one for a monopoly facing an inelastic demand. The 
Lerner index for the market is then L = £" =1 s ; L ; which, in a first approximation, is equal 
to H divided by the elasticity e of the market demand as shown by Dansby and Willig 
(1979).. Computing the Lerner index at the Cournot equilibrium, one finds L ; = — where 
is the market share and e the elasticity of consumer demand. One then checks 



that Zj SjLj = I Zj s] = Hie. 

On analyzing a potential M&A operation between industry members i and j, the 
current HHI is often compared to the potential HHI that would result if all market 
shares remained the same because the computation involved is trivial. The mechanical 
increase of HHI is (s; + s,) 2 - (s 2 + s 2 ) = 2s;s ; -. Yet, if no firm changes its output, the price 

J % J J 

does not change either so that welfare increases as soon as there is an economy of scale 
for the merged entity (i.e., the sum of insiders profits increase). It would therefore appear 
that HHI is not a good indicator of market power. This apparent contradiction is in fact 



a reminder that the post merger HHI must be computed at a market equilibrium just 
like the pre-merger HHI in order that we can meaningfully compare these two figures. 

15.3.3 Industry Cases 

We conclude this chapter with several numerical illustrations. §9.1.2 presents the con- 
centration figures for the famous OPEC oil cartel. 

Movies 

Data kindly provided by Bruce Nash allows to look at the market shares of the major 
film studios as reported in Table 15.14. While Disney is a clear leader, there are five 
other major studio that maintain concentration at a low level. 



Distributor 


1995 


2000 


2005 


2009 


Mean 


Walt Disney 


22.5 


21.3 


14.4 


11.8 


18.0 


Warner Bros. 


16.4 


11.4 


16.0 


20.0 


14.4 


Sony Pictures 


13.3 


9.7 


10.9 


14.1 


13.5 


20th Century Fox 


8.0 


10.0 


16.4 


16.0 


12.5 


Paramount Pictures 


10.0 


10.7 


9.6 


14.3 


11.7 


Universal 


12.6 


15.8 


12.4 


9.9 


11.5 


New Line 


6.2 


5.0 


4.7 


0.0 


5.0 


Dreamworks SKG 


0.0 


10.3 


5.6 


0.0 


3.5 


MGM 


6.3 


1.4 


2.0 


0.2 


2.9 


Lionsgate 


0.0 


1.4 


3.2 


3.8 


2.3 


Weinstein Co. 


0.0 


0.0 


1.1 


1.6 


0.4 


Summit Entertainment 


0.0 


0.0 


0.0 


4.5 


0.6 


Newmarket Films 


0.0 


0.0 


0.1 


0.0 


0.4 


IMAX Films 


1.2 


1.2 


0.4 


0.2 


0.4 


Gramercy 


1.1 


0.0 


0.0 


0.0 


0.2 


Revenue ($bn) 


5.3 


7.5 


8.9 


10.6 


8.2 


HHI 


1356 


1278 


1166 


1334 


1191 



Table 15.14: Movie Distribution Market Shares 



The situation is similar in France as shown on Table 15.15 but with changing posi- 
tions as alliances are made and broken. 



Distributor / 2000 


% 


Gaumont Disney 


19.7 


UIP 


12.5 


Bac Films 


10.7 


Pathe 


9.2 


UFD 


9.2 


Columbia 


7.3 


ARP 


7.0 


Warner Bros 


6.8 


Metropolitan 


2.7 


Pyramide 


2.0 


Total top ten 


87 


HHI 


1003 



Distributor / '2004 


% 


Warner Bros 


15.0 


UIP 


11.3 


Pathe 


11.1 


Disney 


10.8 


Gaumont Columbia 


10.4 


Mars Distribution 


9.4 


UFD 


7.2 


Metropolitan 


4.1 


TFM Distribution 


3.9 


EuropaCorp 


2.7 


Total top ten 


86 


HHI 


898 



Distributor / 2009 


% 


20th Century Fox 


12.2 


Warner Bros 


9.4 


Pathe 


7.9 


Sony 


7.7 


Disney 


7.5 


SND 


6.3 


TF1 /UGC 


5.7 


Mars Distribution 


5.0 


StudioCanal 


4.9 


Metropolitan 


4.9 


Total top ten 


72 


HHI 


642 



Table 15.15: Movie Distribution Market Shares 

Computer 

During the 1990s, Microsoft's products have gained a very large market share by out- 
performing competitors. For operating systems, the family of windows has gained an 
extremely dominant position that has only begun to shrink in the latest years; it is the 
ubiquitous example of monopoly. In the market for web browsers, the various versions 
of IE have proven less resistant to the oncoming of a variety of challengers. Lastly, the 
newest market of web search has allowed Google to become a new mammoth of the in- 
ternet. 

Table 15.16 uses recent data collected by NetApplications to illustrate this point but 
also the recent threats posed by aggressive new comers. Using these data, Figure 15.4 
displays the evolution of the monthly HHI for the three digital markets. 




2005 



2006 



2007 



2008 



2009 



2010 



Figure 15.4: HHI in the digital markets 



European Industries 



In the food retail (supermarkets) sector, the EC has computed the EU wide concentration 
index C 5 for 1996; it obtains a low 15% which indicates an absence of concentration. 



OS 


Windows 


Mac 


Other 








HHI 


2004 


90.3% 


3.2% 


6.5% 








8198 


2005 


92.6% 


3.9% 


3.5% 








8602 


2006 


92.5% 


5.2% 


2.3% 








8580 


2007 


91.6% 


6.2% 


2.2% 








8430 


Browser 


IE 


Firefox 


Safari 


Netscape 


Opera 


Other 


HHI 


2004 


92.3% 


2.7% 


1.5% 


2.1% 


0.5% 


0.9% 


8536 


2005 


86.5% 


8.6% 


2.6% 


1.3% 


0.5% 


0.5% 


7568 


2006 


81.3% 


13.0% 


3.9% 


0.8% 


0.6% 


0.4% 


6791 


2007 


79.8% 


13.7% 


4.7% 


0.8% 


0.7% 


0.4% 


6570 


Search 


Google 


Yahoo! 


MSN 


Other 






HHI 


2004 


50.2% 


15.0% 


14.6% 


20.2% 






3040 


2005 


57.5% 


12.5% 


9.6% 


20.4% 






3642 


2006 


65.2% 


11.6% 


5.4% 


17.9% 






4475 


2007 


64.8% 


11.3% 


4.8% 


19.1% 






4420 



Table 15.16: Market Shares and HHI in the Software Industry 



Nevertheless Europe was still the conjunction of 15 national markets at the time, not a 
unified single market; this is reflected by the fact that the average of national C5 indexes 
is a much larger 44% because many member states have powerful national retail chains. 
Matraves (1999, 2002) in her works on European integration and market structure offers 
interesting data gathered in Table 15.17. 



c 4 


1970 


1975 


1980 


1987 


1993 




c 4 


1985-7 


1991-3 


DE 


25.4 


34.0 


32.2 


20.8 


22.0 


DE 


26 


28 


UK 


22.1 


26.2 


29.0 


34.9 


63.2 


UK 


34 


35 


IT 


25.9 


30.3 


41.1 


27.0 


23.6 


IT 


17 


15 


FR 


66.9 


70.7 


76.2 


63.2 


63.4 


FR 


11 


11 


EU 


- 


- 


- 


26.4 


28.7 


EU 


19 


16 



Soft Drinks 



Pharmacy 
Table 15.17: Concentration in some European Industries 



Automobile 

Table 15.18 illustrates the European market for cars. 17@ We observe almost no concen- 
tration at the EU level but one should bear in mind that the sector benefited from a block 
exemption that fragmented the single market in rather isolated national markets. A 
recent EC directive has revised the exemption; it still permits restrictive vertical agree- 
ments between manufacturers and their dealers but only if the market shares held by 
the companies concerned do not surpass the 30% and 40% limits. Furthermore it elimi- 
nates from exemption the location clauses whereby the dealer is assigned a specific main 



location and is prohibited from operating additional sales or delivery outlets at other lo- 
cations. The aim is to facilitate multi-brand outlets (i.e., foster interbrand competition) 
and reduce territorial restrictions (i.e., foster intrabrand competition among dealers). 



Firm / Year 


1980 


1985 


1990 


1995 


1999 


Fiat 


18.7% 


15.7% 


13.8% 


11.1% 


9.5% 


Volkswagen 


15.6% 


16.4% 


15.7% 


16.8% 


18.8% 


Peugeot-Citroen 


17.5% 


12.9% 


12.7% 


12.0% 


12.1% 


Ford 


14.8% 


16.3% 


11.5% 


13.7% 


11.7% 


Renault 


16.6% 


12.0% 


9.7% 


10.3% 


11.0% 


Opel (GM) 


11.1% 


13.3% 


12.0% 


13.1% 


11.5% 


Japan 


10.9% 


12.1% 








Daimler-Chrysler 


4.3% 


4.2% 


3.2% 


4.0% 


5.6% 


Nissan 


3.8% 


2.7% 


2.9% 


3.0% 


2.6% 


B.M.W. 


0.0% 


3.1% 


5.6% 


6.4% 


3.2% 


Toyota 






2.7% 


2.5% 


3.2% 


Mazda 






2.1% 


1.4% 


1.4% 


Other 






2.1% 


0.5% 


0.3% 


Rover 


0.0% 


4.3% 


0.0% 


0.0% 


1.5% 


Mitsubishi 






1.3% 


1.1% 


1.2% 


Honda 






1.2% 


1.5% 


1.4% 


Korea 


0.0% 


0.1% 


0.1% 


1.5% 


3.2% 


Sales (Million) 


7.4 


9.5 


13.5 


12.0 


15.1 


HHI 


1580 


1338 


1042 


1092 


1059 


C 4 


67% 


61% 


54% 


54% 


52% 



Firm / Year 


2000 


2003 


2006 


2009 


Volkswagen 


18.7% 


18.2% 


19.8% 


20.9% 


Peugeot-Citroen 


13.1% 


14.8% 


13.2% 


13.3% 


Ford 


10.8% 


11.0% 


10.7% 


10.4% 


Opel (GM) 


10.8% 


9.8% 


10.2% 


8.9% 


Renault 


10.6% 


10.6% 


8.6% 


9.1% 


Fiat 


10.0% 


7.4% 


7.6% 


8.8% 


Daimler-Chrysler 


6.2% 


6.5% 


6.2% 


4.9% 


Toyota 


3.7% 


4.8% 


5.8% 


5.2% 


BMW 


3.4% 


4.4% 


5.3% 


5.1% 


Nissan 


2.7% 


2.8% 


2.1% 


2.6% 


Hyundai 


1.5% 


1.7% 


2.0% 


2.2% 


Honda 


1.2% 


1.4% 


1.7% 


1.6% 


Mazda 


1.2% 


1.5% 


1.7% 


1.5% 


Suzuki 


0.9% 


1.0% 


1.5% 


1.6% 


Kia 


0.5% 


0.8% 


1.5% 


1.6% 


Mitsubishi 


1.1% 


0.8% 


0.8% 


0.6% 


Others 


0.3% 


0.3% 


0.6% 


1.1% 


Sales (Million) 


14.7 


14.2 


14.8 


13.7 


HHI 


1048 


1038 


1035 


1060 


C 4 


53% 


54% 


54% 


53% 



Table 15.18: European Automobile Market (EU-15) 



Regarding the US vehicle market, we use WardsAuto's data on Cars and Trucks to 
construct Table 15.19. We present the market share of the 16 largest sellers of the last 
decade ordered by their average. The continuous substitution of US historical big-three 
towards the Japanese big-three induces a reduction of concentration. As observed on 
Figure 15.5, the HHI concentration index has steadily decreased over the last 40 years 
and converged to the low level achieved in the EU two decades ago. 



Firm/Year 


2000 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


2008 


2009 


GM 


28.0 


28.0 


28.3 


27.7 


26.9 


25.6 


23.9 


23.2 


21.9 


19.6 


Ford 


22.6 


21.6 


19.9 


19.2 


18.0 


17.0 


16.0 


14.6 


14.2 


15.3 


Toyota 


9.1 


10.0 


10.3 


11.0 


11.9 


13.0 


15.0 


16.0 


16.5 


16.7 


Chrysler 


14.2 


13.0 


12.9 


12.5 


12.8 


13.2 


12.6 


12.6 


10.8 


8.8 


Honda 


6.5 


6.9 


7.3 


8.0 


8.1 


8.4 


8.9 


9.4 


10.6 


10.9 


Nissan 


4.2 


4.0 


4.3 


4.7 


5.7 


6.2 


6.0 


6.5 


7.1 


7.3 


Hyundai 


1.4 


2.0 


2.2 


2.4 


2.4 


2.6 


2.7 


2.8 


3.0 


4.1 


Volkswagen 


2.5 


2.5 


2.5 


2.3 


1.9 


1.8 


1.9 


2.0 


2.3 


2.8 


Daimler 


1.9 


1.8 


1.8 


1.9 


2.0 


2.2 


2.4 


2.1 


2.4 


2.4 


BMW 


1.1 


1.2 


1.5 


1.6 


1.7 


1.8 


1.8 


2.0 


2.3 


2.3 


Kia Motors 


0.9 


1.3 


1.4 


1.4 


1.6 


1.6 


1.7 


1.9 


2.0 


2.8 


Mazda 


1.4 


1.5 


1.5 


1.5 


1.5 


1.5 


1.6 


1.8 


2.0 


2.0 


Subaru 


1.0 


1.1 


1.1 


1.1 


1.1 


1.1 


1.2 


1.1 


1.4 


2.0 


Mitsubishi 


1.8 


1.9 


2.0 


1.5 


0.9 


0.7 


0.7 


0.8 


0.7 


0.5 


Volvo 


0.7 


0.7 


0.7 


0.8 


0.8 


0.7 


0.7 


0.7 


0.5 


0.6 


HHI 


1610 


1565 


1524 


1475 


1421 


1353 


1329 


1268 


1198 


1440 



Table 15.19: US Vehicle Market Shares 




2000 



2010 



Figure 15.5: HHI evolution in the US & EU car markets 



The degree of penetration of foreign cars in the EU is approximatively 45%, mostly 
Japanese and Korean brands. The corresponding figure for the US is 55% as European 
car makers have a non negligible share. On the contrary, the degree of openness of 
Japan and Korea is extremely low around 5% which means that the local brands have a 
greater market power (3 in Korean, 7 in Japan). One should not however conclude that 
Europeans show less national preference. Indeed, as shown on Table 15.20, member 
states of the EU show a marked preference for local car makers although with a twist. 

Germany has 3 truly national firms but is also the place in Europe where 2 historical 
US firms develop cars; in the end, the national share is beyond two thirds. As a con- 
sequence concentration is about 50% higher than the European figure; there are many 
equally sized firms competing for the German market but the VW is really a dominant 
player. The French case fits the US or asian countries with a strong national preference 
and an extremely high concentration back in the 80s that slipped due to the decline of 
Renault and the progressive penetration of all foreign brands. On appearance, Spain is 
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Table 15.20: Vehicle Market Shares in european Countries 

different because its sole national brand is now foreign owned; yet, the country is home 
to assembly plants from most European car makers so that we may compute a quasi 
national share at about 70%; this diversity is reflected into a concentration level only 
slightly higher that the EU value. Italy, lastly, hosts a unique large automobile group 
that dominates the market; yet concentration reaches a level similar to the German one 
as other automakers have shares around 10%. 

Airlines 



Table 15.21, using US Federal data, reports on the US market for domestic and interna- 
tional flights as well as revenue (for US carriers only). Tables are sorted by decreasing 
rank in 2009. Figure 15.6 reports the evolution of HHI over the last tow decades for 
domestic and international passenger volumes. 
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Int. Mkt. 
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2000 
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Dom. Revenue 
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Table 15.21: Airlines Market Shares in the US 
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Figure 15.6: Concentration in the US airline market 



15.3.4 Size Distribution 

Cities (within a country), firms (within an industry), wealth (within a population) display 
highly unequal sizes, mostly because some have grown faster. As originally shown by 
Pareto (1896) for wealth distribution, Auerbach (1913) for cities population and Gibrat 
(1931) for firm size, 80% of the quantity being measured comes from 20% of the producing 
units aka. the 80-20 rule. Such an empirical regularity has been observed for a large 
number of social activities but also in nature. For instance, Gaffeo et al. (2003) show 
that the distribution of firms' size in the most industrialized countries follow closely 
Zipf's rank rule presented hereafter. 

It is important to explain why the statistical distribution of firm sizes does not obey 
the predicaments of theory where firms end up being symmetrical in equilibrium. The 
explanation given by Gibrat (1931) is that each family in a city, each employee in a firm 
or each euro of capital wealth are equally able to generate an offspring, come up within 
an innovation or being invested in a superior project. As shown in the theory section 
below, such a proportional (or size independent) growth process generates a power law 



statistical distribution (after enough periods) fitting adequately the empirical one. First, 
we offer an informal presentation based on popular items of everyday life. 



Intuition: Zipf's law 

Classify all your emails according to who is the sender or receiver and sort them into 
separate boxes for each of your m contacts. Then order the boxes from the biggest to the 
smallest as in Table 15.22. 



Names 


Ali 


Bess 


Chris 


Doug 


Ellen 




#i 


Total 


Count 


n a 


n b 


n c 


na 


n e 




rik 


N 


Frequency 


f _ n a 
la- N 


ft 


fc 


fd 


fe 




fk 


1 


Set r^ f a 
and compute 


fJr 


Mr 


fdr 


fair 


feir 




Mr 




only to find 


1 


= 1/2 


= 1/3 


= 1/4 


= 1/5 




- \lm 





Table 15.22: Email contacts 

It is then possible to show that Z^j - 0.577 + ln(m) with convergence for m large. 
Manipulating the equation 1 = Y.f =l fi, one obtains y - 577 li n{m ) i- e -> that in most human 
collections like books or friends or words in a language, the most frequent item has a 
frequency as given in the following table: 



Collection size (m) 


10 


100 


1000 


10 4 


10 5 


10 6 


10 7 


10" 
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Frequency (%) 


35 


19 


13 


10 


8 


7 


6 


5.3 


4.7 



Table 15.23: Frequency of preferred item 

Now, the utility we derive from exchanging emails with all m contacts can be esti- 
mated by the total number of emails 

m m -i m -i 

u(m) = N = NY J fi = NyY J - = n a Y.-^ n a (0-577 + ln(m)) « log(m) 

i=l i=l l i=l l 

the base 10 logarithm since we can change the monetary unit (divide by j^jj) and 
then subtract a suitable constant without altering the representation of preferences (cf 
§2.2.1). Hence, the utility of having 10 email contacts is log(10) = 1 while that of 100 
contacts is log(100) = 2. In other words, my "top-ten" contacts are as valuable as the re- 
maining 90 entries. This decomposition of total utility into increments can be applied to 
other fields using the property log(10 m ) = m: 

• Books collection of size 1000: the 10 books I read most often are as worth to me as 
the next 90 I consult from time to time which are as valuable as the remaining 900 
which I barely remember reading. 



• Internet Music Store of size one million songs: the collection may be divided into 3 
clusters of equal value to the owner (say yourself or a store): the top 100 of "hits", 
the next 9900 of "classics" and the remaining 990000 "never-heard-of" songs. 

• US Phone Network: if the customer base is approximatively 100 millions (10 8 ) then 
the 10000 biggest clients (10 4 ) generate half of the network value. 

This clustering shows that digital databases have a very Long Tail that can still be 
profitably marketed thanks to the nearly zero cost of accessing an item when there is a 
request for it. 

Theory 

An economic variable X follows a power law if the complementary cumulative distribu- 
tion function 18@ has the form Pr(X > x) = bx~ a for some positive parameters a and b. The 
property is said to apply to the (upper) tail when it holds true for large x only. To see 
whether some data follow the power law, take a set of ordered observations x\ > .. > x n , 
estimate Pr(X > x0 by gi = ^ and plot (huc,',lngj)i<n to find a straight line with slope -a. 

Gibrat (1931)'s hypothesis of a proportional growth process is Z t = -^ — - = n + ae t 
where e t is a white noise. 19@ The idea is that each family in a city, each employee in 
a firm or each euro of capital wealth are equally able to generate an offspring. If this 
is correct, we can use the fact that Z t = lnX f+ i -lnX f for small periods, to sum from a 
initial period up to a final time T and get In Xj - In X + /u T + a £ t < T e t . By the central limit 
theorem, the mean ^Et<r £ f tends to a normal variable when T grow large, so that ln^ 1 
tends to a normal variable with mean fxT and variance a 2 T (i.e., the distribution of X T is 
lognormal). 

Gibrat (1931)'s model fits well empirical data except for the upper tail that is system- 
atically fatter or longer than predicted in the sense that more large units are observed 
than predicted. It is then necessary to take into account the entry and exit of units from 
the current population. Regarding firms, we may assume that, at each period, a new 
market opportunity or innovation arises and can be taken up by an existing firm or by 
an entrepreneur that will create a new firm to exploit it. When we look today at the 
current population, it consists of units born at different times in the past so that we 
do not have a sample of realizations of the random variable X T for an exogenous T but 
realizations of Xf where f is the endogenous age of the firm. Reed (2001) shows that if 
the instantaneous probability of entry A is constant over time (exponential process) then 



Y = In -jr- follows a power law in both the upper and lower tails i.e., 

| Pr{Y > y) = ^e~ a y ify>0 ^ j Pr(X t > x) = J^x~ a ifx>x 
\ Pt{Y < y) = -^pePy ify<0^{Pr{X t <x) = ^xP ifx<x 

where a and -/3 the two roots of the characteristic quadratic equation \a 2 t{t- 1) + [it = X 
and using y = ln(^). 

Xq 



PartG 



Public Oversight 



Part D has studied the interactions of firms with competition law taking the latter as 
exogenously given. In this part, we inquire about the motivations for public authorities 
to step up into the economic life. We take a shortcut by referring to the State as the 
superstructure gathering all the organizations in charge of framing the economic activ- 
ity; it includes the legislative, judicial and executive bodies and within the latter, the 
government, its bureaucracy and regulatory agencies. 

The first chapter recalls what is the State, why it exits, how it came to weight so much 
over economics and how it receded a little in recent decades. The next chapter takes an 
advanced look at regulation, the direct State intromission in an industry. We end this 
part with a chapter on natural resources because this sector displays natural monopoly 
features and externalities requiring public oversight. 



Chapter 16 



The State 



In microeconomics, the State is foremost associated with the design and enforcement of 
the legal framework under which economic agents interact. Yet it is also the main eco- 
nomic actor being the largest employer (civil servants, health and education personnel), 
the largest purchaser (procurement), the largest investor (infrastructures) and, last but 
not least, a tight controller of most markets through price caps, quotas and regulations. 
Firms thus make business with the State while being overseen by him. This chapter will 
try to make sense of these seemingly unrelated activities, keeping an emphasis on the 
implications for firms. 

We first present the missions that the State ought to perform and the leading role it is 
driven to assume in the economy. We then rationalize this judge-and-party duality using 
the economic concepts of market failure and its opposite, the government failure. We also 
offer a power struggle view of the formation of the State called rent -planning. It helps 
to make sense of the ubiquitous rent-seeking phenomenon induced by the hierarchical 
State-firm relationship, which is the object of the next section. Lastly, we review the 
recent changes in the structure of the State with the liberalization wave. 

16.1 Missions and Means 

16.1.1 Missions 

The French Republic's motto of liberty, equality and fraternity summarizes neatly the 
objectives of advanced democracies: rule of law, provision of public services and wealth 
redistribution. We present them in turn and conclude with a few statistics regarding the 
weight of the State in the economy. 1@ 



Rule of Law 

The pursuit of happiness, to which we all aspire, requires living in a peaceful society 
and enjoying effective political and economical freedoms. The accepted definition of We- 
ber (1922) defines a State as "a human community that (successfully) claims the mono- 
poly on the legitimate use of physical force within a given territory". Its sovereign mis- 
sions, defining a minimal State, are diplomacy, defense, justice and police. We can also 
add macro-economic stabilization since nation-wide crisis have the potential to generate 
chaos and anarchy. 

The general aim of justice is to protect individuals from abuse and exploitation. The 
State sets limits to our freedom in some dimensions to guarantee us minimal freedoms 
in all dimensions; it is then the enforcer of "human rights". The transposition of "free- 
dom as effective rights" from citizens to economic agents (firms, entrepreneurs) revolves 
around "property rights" which are fully developed in §8.1. The core mission of the State 
is thus to establish the rule of law i.e., that everyone, including the State itself (and its 
representatives), is to obey the laws enacted by the community (through its representa- 
tion system). The hurdles faced in this endeavor are analyzed later in §16.2.4. 

Public Services 

Equality as a guiding principle of democracies is nowadays understood as equality of 
opportunities, not of results such as income. The State has then a mandate to "level 
the playing field" for individuals, in the same manner that competition law does for 
firms (cf §8.2.4). To implement this equity principle, society makes value judgments 
and defines the public services as those which citizens "should" consume. This modern 
vision justifies in hindsight the avowed paternalism of economists (and more generally 
intellectuals) that existed before the advent of democracy and widespread education. 2 ® 
An economic, but partial, rationalization is soft paternalism: as we suffer from bounded 
rationality, cognitive problems or lack of information, we make mistake so that there is 
a corrective role for the State (cf. also Behavioral Economics). 

For out purpose in this book, public services are conveniently divided into essential 
and network ones. The former are education, health, sport, culture, housing and social 
services. 3 ® The State subsidizes some of them but more often than not, organizes free 
universal provision (in-kind transfers) or even mandates compulsory consumption. The 
fact that the State is also a producer of these services has political roots but is also 
an optimal economizing response in the face of a lasting market failure: information 
asymmetry (cf. §13.1.3). 

The network services are transportation (road, train, plane), communication (mail, 



phone, internet) and households services (water, sewage, energy). Although society does 
not treat these services as "essential", they are seen as "basic". For that reason, the 
State oversees their development and funds their consumption, both on the supply and 
demand sides. At the same time, the industrial sectors supplying network services are 
natural monopolies involving high sunk costs, positive network externalities and a po- 
tential abuse of monopoly power. As we explain in detail in §17.1.1, these market failures 
warrant State oversight in order to guarantee maximal efficiency (allocative and produc- 
tive). 

Public services are provided either directly by state owned enterprises (SOE), indi- 
rectly by delegation to private firms or by a mix of the previous options. The specialized 
arm of the State supervising a private provider is called the regulator (so that SOEs are 
often referred to as self-regulated). Its tasks and the analysis of the strategic interaction 
with natural monopolies is the object of chapter 17 on regulation. 

Welfare State 

Fraternity (brotherhood) provided by the traditional circles of family, business or parish 
has faded with the anonymity of urban life; this social safety net has been replaced 
in affluent countries by a collective system known as the welfare state. It is foremost 
an expression of fairness that complements the equality of opportunities afforded by 
public services. According to Esping- Andersen and Myles (2008), the main component 
is social security which provides insurance in areas of pensions, health, family, labour 
or minimum wage. Although the very financing of economic security involves in itself 
some wealth redistribution, direct schemes (e.g., income or property tax) form a second 
component of the welfare state aimed at reducing income disparity, vertically towards 
the less affluent or horizontally towards deserving social groups. 4@ 

Economic Perspective 

We review fundamental, essential and basic services as well as the welfare state from a 
stricter economic point of view. 

Economically speaking, the fundamental services rendered by the minimal State are 
public goods i.e., non marketable and consumed in the same amount by all. 5@ As such 
they suffer from free riding, namely that voluntary contribution would raise too little 
in order to sustain a fully functioning market economy. Thanks to its monopoly on co- 
ercion, the State is able to finance them by taxing the general economic activity (firms, 
households, transactions,...). The downsides are, the violation of consumer sovereignty 
as we are made to pay more than we wish, a sacrifice of valuable scarce resources and an 



inefficient distortion of incentives (cf marginal cost of public funds in §17.1). Those defi- 
ciencies are nevertheless acceptable because fundamental services implement the rule of 
law i.e., reduce the numerous market failures arising from imperfect property rights and 
fraud in the entire economy (cf. §8.1). Given that the rule of law is firmly established in 
advanced economies, we are warranted to analyze an increase of fundamental services 
in terms of cost vs. benefit. 6@ 

Essential services and the welfare state mission involve a much stronger moral stance 
because they are financed by taxation, provided in-kind (instead of cash transfer) and of- 
ten have a mandatory character. As recalled by Currie and Gahvari (2008), the related 
markets display some market failures but the overriding explanation of the State's in- 
volvement is political will (or paternalism). Now, since taxation, the main financing 
vehicle, is distortionary and generates welfare losses, we face the dilemma of choosing 
between efficiency and equity. Lindert (2007) however shows, using recent OECD evi- 
dence, that the trade-off need not be so stark: there is no growth loss from spending more 
on the welfare state. 7@ In a sense, the welfare state is concomitant to growth, neither 
inimical nor friendly. 

Lastly, network services are not essential but only basic, they are marketed and not 
given away. Providers, whether public or private, must then cover their cost out of their 
receipts. At this point, the equity concern of society means that market segments with 
low willingness to pay and/or hight cost of supply are to be subsidized to foster access 
to the service (and its consumption). For sure, there is still a value judgement involved 
in the rates distortion but in a much weaker degree than for essential services since 
everyone still carry part of the underlying cost. We develop this issue in §17.3.4 on 
public service obligation. 

16.1.2 Means 

To carry on its costly missions, the State uses its exclusive dominion of coercion to tax 
economic units, operate monopoly over some profitable activities and seize land. Laws, 
decrees and regulations compel citizens and firms to behave adequately under the threat 
of retaliation with fines, confiscation or imprisonment. In democracies, expropriation for 
a right-of-way (eminent domain in US parlance) is conditional on a just compensation 
(cf. art. 17 of the French Human Rights declaration or the US Fifth Amendment. 

In advanced economies, the weight of the State is by no means small since more 
than a third of all wealth creation goes through the State who thus becomes the largest 
employer and the biggest spender in the nation (cf. procurement in §13.3.3). Historically, 
Tanzi and Schuknecht (2000) report that general government expenditure (including 



central, state and local governments' expenditures on goods and services, debt service 
and transfers to economic units) in OECD countries grew as shown in Table 16.1. 8@ 



1870 


1913 


1920 


1937 


1960 


1980 


1990 


1996 


11 


13 


20 


24 


28 


42 


43 


45 



Table 16.1: OECD Government Expenditure in % of GDP 

For the EU15 (resp. EU27), general government revenue 9@ averages 45.5% (resp. 
44.4%) of GDP over the last 10 (resp. 5) years with a 58% maximum for Sweden. The 
US and Japan equivalent figures are 33% and 31%. Most revenues come from taxes, the 
rest being sales of public services. Currently, government tax revenue, including social 
security, is about 41% of GDP in Europe (EU27) ranging from 50% in Sweden to 29% in 
Romania. It comes in equal parts from social contributions, direct taxes (income and 
wealth) and indirect taxes (products and imports). 

Table 16.2 displays, for some OECD countries, the distribution of expenditure accord- 
ing to the COFOG methodology with an aggregation fitting our three missions typol- 
ogy. 10@ It is noticeable that the weight of all three missions decrease over the decade (cf 
§16.4 on liberalization) except for the converging Korean economy. To assess how costly 
is the core mission of upholding economic and political rights, we take off defense and 
debt service (interest) and come-up with a cost between 6 and 8% of GDP 11@ 

Regarding civil service, Table 16.2 also display public employment cost as a share of 
GDP (cf. labour column). Comparing with the labour force, the highest rates of public 
employment in 2001 are Denmark (23%), Finland (21%) and France (18%) (cf. OECD 
(2005)). Some Eastern Europe countries had high levels but have reduced them since 
1990 to sanitize their public finances. Although the Anglo-Saxon countries profess a 
quite different model of the State, the figures for Australia, Canada, the UK and the US 
range from 16% to 14%. 

The conclusion that emerges from this statistical section is that the State is a gigan- 
tic actor in all advanced economies, whatever their avowed political stance may be. It 
controls and manages between one third and one half of the GDP. 



16.2 Rationalizing the State 

Having presented what the State does, we now seek to rationalize this outcome. We thus 
expose the standard theories regarding the role of State in the economy and conclude 
with a view more focused on the power motives of leaders. 
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Table 16.2: State expenditure in % of GDP 



16.2.1 Public Interest 
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Following the lead of Pigou (1920), the public interest theory sees the State as having 
a duty to solve market failures, those market equilibria that fail to satisfy the equi- 
marginal conditions for Pareto efficiency (cf. first welfare theorem in Mas-Collel et al. 
(1995) or §2.1.1). These failures can originate from externalities, public good features, 
information asymmetries and monopoly like behavior. For instance, the core mission of 
the State is to uphold the rule of law in order to guarantee the basic political and eco- 
nomical freedoms. This endeavor has the character of a public good or at least displays 
positive externalities. Since voluntary provision is plagued by the free riding, 12@ the 
leadership assumed by public authorities is warranted. 

According to the public interest view, the State ought to subsidize and support activ- 
ities displaying positive externalities (e.g., research, education, health) and to limit or 
tax those displaying negative externalities (e.g., monopoly pricing, pollution, noise, con- 



gestion). 13@ Asymmetric information, dealt with in Part H, is at the root of innumerable 
quantity and quality regulations such as construction (zoning, materials, codes), food 
(fertilizers, processing, hormones), retail (zoning, schedules, labels), transport (zoning, 
security, pollution, driving), schooling (certification, curriculum), health (diploma, licens- 
ing, drug authorization) and safety standards (cf also §9.1.2). Most items in the list are 
less than a century old and were devised alongside the general increase of welfare in 
advanced economies. It is therefore natural to asks whether this improvement of our life 
standards occurred thanks to regulation or despite of it. We agree with Shleifer (2005) 
that regulation, as we know it, is society's optimal response to deep market failures (cf. 
§17.1.1 on the origins of natural monopoly). As technology and tastes evolve, so does 
regulation (though too slowly). 

Although a market failure is a hard fact, the belief that the government can correct 
it at no cost indulges into what Demsetz (1969) calls a "nirvana fallacy"; in other words, 
a government failure is equally possible. 14 ® We now proceed to discuss this possibility 
with two complementary theories. 

16.2.2 Public Choice 

Public Choice brings the utilitarian and rational behavior assumed in economic theory to 
the study of politics: voters, politicians and bureaucrats are viewed as self-interested. 15@ 
With such a starting point, leaving no space to dedication, enlightenment and coopera- 
tion, the outcome is necessarily bleak: in equilibrium of the political game, all activity 
serves the political elite and its supporters. 16 ® The rent-seeking concept, studied thor- 
oughly in §16.3, is the main contribution of public choice to economic theory. 

A few important results are as follows. Downs (1957) shows that voters have few 
personal incentives to monitor government and politicians effectively because their vote 
or political activism weight so little. As a consequence, parliament is almost free to pass 
any law and the government almost free to pursue any policy. Olson (1965)'s theory of 
collective action abounds and further shows that special interest groups, with few mem- 
bers, have on the contrary maximal incentives to step-in the political arena. This tends 
to explain the prevalence of lobbies and pressure groups. Buchanan and Tullock (1962) 
show that the political decision-making game is one of bargaining among politicians who 
represent local special interests rather than their voters. 17@ In equilibrium, vote trading 
among politicians is observed to reach a majority and advance their particular objectives 
(see also quid-pro-quo or patronage, cronyism, spoils system or pork-barrel and logrolling 
in US parlance). This process is inefficient because most of these laws bring little local 
benefits but require additional distortionary taxation at the country level for financing. 



In the US, mandatory cost-benefit analysis is an intent to reign in this wasteful process. 

As far as industry regulation is concerned, we may say it is second-best efficient 
because of the constraints weighting on the political system. Although each regulation is 
initially designed to fix a market failure or carry a moral goal, it is transformed during 
the enactment process (to pass the majority threshold). As we show in §16.3, there is 
a lobbying cost involved. A second source of inefficiency is the "tollbooth" problem, a 
modern form of rent-planning, whereby politicians and bureaucrats use regulation to 
their own advantage (cf §16.2.4). First, they limit competition (instead of fostering it) to 
create rents and then they extract the rents from the regulated firms through campaign 
contributions, votes and bribes (cf. §16.3.2). 

The public choice literature agrees, in principle, to state intervention in order to solve 
market failures and finance public goods. Yet, it observes that the government tends 
to expand without bounds up to the point where it does more harm than good to the 
economy and the population. It thus advocates dealing with external economies and dis- 
economies through voluntary cooperation by setting up private organizations instead of 
governmental bureaus. 18@ Experience shows that such a construct works only for small 
communities. In larger settings, the private organization managing the voluntary co- 
operation of citizens has to establish rules to enforce cooperation and avoid free riding. 
Soon, it erects itself as a new (local) government. In other words, society cannot de- 
velop without government, whether at the central or local level. We must cope with the 
inevitable inefficiencies it will involve in the decision process. 

16.2.3 Regulatory Capture 

Whereas "Public Choice" focuses on political bargaining and the distortions it induces on 
the whole economy, "Regulatory Capture" is an offshoot focusing on regulated industries 
i.e., natural monopolies (cf. §17.1). 

Issue 

At the high point of regulation in the US during the 60s, many economists start to feel 
that the SCP paradigm (cf. §1.2) fails to adequately describe and model many US mar- 
kets, especially those regulated by federal laws. Stigler (1971) takes a strategic approach 
to regulated industries and introduces the Special Interest Group as an actor reciprocat- 
ing with the government, the consumers and other (rival) industries. He concludes that 
State interventionism serves special interests rather than the public interest. A clear 
example of this are the agricultural support programs prevalent in industrialized coun- 
tries. 19@ His policy recommendations are thus towards deregulation and "laissez-faire". 



The capture theory is re-reading of the standard Marshallian supply-demand anal- 
ysis using Olson (1965)'s theory of collective action to address group behavior. On the 
supply side, politicians who enact regulations and bureaucrats who implement them are 
responsive to pressure because politicians seek support in terms of vote or campaign 
money while bureaucrats seek cash or in-kind transfers or future job opportunities. On 
the demand side, an oligopolistic industry has either much to gain or to lose from regula- 
tion, hence its members are strongly motivated to unite into a cohesive industry associa- 
tion to lobby their regulator or the legislator. Whenever there is a small number of firms 
in the sector, the free rider problem characteristic of collective action is manageable and 
the association becomes a powerful lobby, likely to influence legislation (cf §2.4.4). The 
other main actor on the demand side is the general public comprising all the consumers 
of regulated goods and services. Since an individual consumer weights little over the 
final outcome, he has very low incentives to get informed about policy debates and act 
to obtain a favorable regulation. The free rider problem is then maximal for consumers; 
hence, their global influence over regulation is likely to be weak. 

These considerations lead to conclude that regulators are captured by those they are 
supposed to regulate. There are plenty of examples where regulation is captured dur- 
ing enactment or meanwhile it is applied but obviously never has it been created by an 
industry to protect itself from competition (as it was sometimes claimed). 20@ The mitiga- 
tion of regulatory capture is taken on in §16.3.4. 

To conclude, it must be observed that regulatory capture is not a recent perversion of 
the antitrust framework. Rather, as we show in the next section, it is the modern form 
of an age old agreement between the State and the industry to limit competition, create 
rents and share those. Rent-planning, as we call it, was one of the mercantilist policies 
reviled by Smith (1776). 

16.2.4 Rent Planning 

To better understand the functioning of markets, we often assume the perfect legal 
framework described in Chapter 8. The various rationalizations of the State seen up 
to now adhere to this conception and disagree only with respect to the ruling elite's ob- 
jective: common good or private interests. Yet, by ruling out political struggles and social 
conflicts, these theories fail to properly account for the recent phenomena of nationaliza- 
tion, liberalization, antitrust and rent-seeking. To make sense of these issues treated 
fully in the oncoming sections 16.3 and 16.4, we follow Acemoglu (2010) (ch. 4) and look 
back at the violent roots of State formation. 



The War Motive 

According to Tilly (1990), war is prevalent throughout the history of human kind be- 
cause elites, whether rulers, oligarchies or parliaments, display a strong attraction for 
prestige and empire building (much like CEOs of modern firms). 21 ® Paramount to suc- 
cessful dominance is military innovation which has two distinct consequences. On the 
one hand, innovations make combat more decisive which in turn leads to the consolida- 
tion of small territories into larger states. 22 ® On the other hand, war becomes ever more 
expensive, forcing rulers to seek revenue beyond the income of their personnel assets 
(mostly royal land). 23 ® The two main channels for this task are taxation and what we 
choose to call rent-planning, namely the monetization of privileges through the carteliza- 
tion of the economy and the subsequent sale of the afferent monopoly rights to private 
investors. For instance, it is well know that central banks were created to enhance the 
creditworthiness of states engaged in wars and allow them to issue more debt. What 
Broz (1998) shows is that the political support from the financial sector was won by 
cartelizing the sector. 

Our novel and neutral terminology emphasizes that these revenue generating prac- 
tices are carried on by all types of rulers from dictatorships to democratically elected par- 
liaments, going through oligarchies. 24@ Rent-planning is thus not directly determined 
by the recourse to violence or the absence of democracy 25 ® Obviously, in each case, the 
design of taxation and rent-planning reflects the size and scope of the elite ruling the 
State. Yet, the fundamental purpose remains the same which is to raise finance in order 
to further the objectives of this governing elite. 

As recalled in §16.1.2, taxation is the main financing instrument of advanced economies 
today, but this is a novel evolution. 26 ® Until the advent of the modern bureaucracy, it is 
fair to say that the administration is inefficient and corrupt which leads to low ratios 
of income to payments. To compensate, rates are set at very high levels and numerous 
taxes are created, which in turn leads to resentment, evasion and revolt. Tax revenue is 
thus endogenously limited and rent-planning can thus be understood as an alternative 
avenue to raise income that visibly hurts no-one and therefore passes relatively unno- 
ticed and unopposed. Rulers use rent-planning to bypass the limits to taxation set by 
their councils (ancestors of parliaments) whose agreement is necessary to change the 
tax code. 27 ® 

Planning Rent Extraction 

To extract the producer surplus of an economic sector, the State creates a private mono- 
poly or a cartel structure for the sector; it then auctions the afferent property rights to 



an entrepreneur or a clique against some compensation that is either (explicit) cash or 
(implicit) political support. 28 ® In this arrangement, the producing side uses its expertise 
and organization skills to efficiently exploit the envisioned economic monopoly and max- 
imize profits. The State uses his mastering at coercion to enforce the exclusive property 
rights of its partner from encroachment by unauthorized competitors whether they are 
nationals, aliens or even a foreign power contesting the monopoly. Since the fee paid to 
the State is independent of the amount of economic activity, the entrepreneur is made 
a residual claimant of all profits he might generate. His incentives to exploit the mar- 
ket, cut cost and innovate are thus maximized, in so far as the State does not renege on 
the initial agreement (and demands extra compensation). The rent-planning scheme is 
quite similar to the fixed price contract used nowadays for the delegation or concession 
of a regulated activity (cf §17.1.1 & §14.4) with the major difference that the scope of 
the contract is the entire economic sector, not just one unit or region or infrastructure. 

There is an inherent hold-up problem since the coercive power of the State allows it 
to renege at any time and extracts ex-post rents from the firm (cf. §14.2). The solution 
at a time where the rule of law does not bind the ruler is to invite the entrepreneurs to 
form a clique (e.g., guild) and enter the elite; this way, their property rights are more 
credibly upheld. This process explains the entry of the bourgeoisie into the elite which 
initially includes only public services providers (defense, justice, police, religion). These 
commercial new comers are however quite different since they have a dual role, being 
cartel members for wealth distribution purposes but also productive agents within the 
economy. This duality is at the root of the democratization process because these cliques 
have long lived investments to protect and have thus the greatest interest to impose the 
rule of law (cf. model in §3.3.3). 29 ® 

The most famous facet of rent-planning is that relating to trade policies known as 
mercantilism, 30 ® protectionism 31 ® and colonialism. 32 ® The general thrust is to protect 
an entire industry with import tariffs, export subsidies and citizenship restrictions (e.g., 
employment, transport, capital). 33 ® Whereas mercantilism emphasizes national monop- 
olies, protectionism vie for competition at the national level which is more conducive of 
efficiency and innovation. 34 ® The free-trade movement associated with the globalization 
of commerce finally aims at removing all barriers to trade. 

Inefficiency 

Rent-planning generates several forms of inefficiency that were clearly enunciated by 
Smith (1776). Firstly, cartelization leads to monopoly pricing, an allocative inefficiency 
(cf. §3.2). Nowadays, the reverse problem of excessive output occurs in the public services 
because price subsidization is used to buy political support from the masses. These 



diametrically opposed price distortions send equally wrong signals for the use of factors 
among industries which is a source of productive inefficiency (cf. §2.1.2) . A second 
source of waste recalled by Baumol (1990) is that entrepreneurship and innovation are 
hampered by the State's systematic appropriation of all new sources of rent; this is an 
instance of dynamic inefficiency 35 ® This encroachment also drives part of the economic 
activity towards the black-market where property rights are seldom enforceable, thereby 
hampering further innovation. 36 ® 

Last but not least, rent-planning is a source of long term institutional inefficiency 
because of the inherent dichotomy between innovators and administrators. Our claim 
is rooted in the observation that inventive entrepreneurs are new comers without link 
to the current elite. This is best understood by looking at their offsprings' occupational 
choices. Entrepreneurs are risk lovers because being penniless, they have nothing to 
lose. The offspring of successful entrepreneurs who have entered the elite are on the 
contrary risk averse because they are born within the elite. It is thus rational for them 
to pick an administrative or political job rather than continuing the risky business of 
their fathers. The ruling elite thus displays a dynastic character. 

Acemoglu (2008) explain the economic consequences of this dynamic as follows: when 
successful entrepreneurs seize political power, they pursue at first policies favorable to 
their trade which is efficient. Now, when a new technological wave starts, the relative 
prices of inputs and outputs change and this calls for a reorganization of economic ac- 
tivity. The ruling elite, only mastering the old technology, stands to lose in front of new 
comers who master the new arts; the elite then rationally blocks the necessary institu- 
tional changes, thanks to its dominion of political power. 37 ® Growth, which depends so 
much on innovation, is thus hampered by institutional inertia. The same basic obser- 
vation for industry sectors leads Schumpeter (1942) to enunciate his theory of creative 
destruction. As a matter of conclusion, one cannot fail to notice that countries were rent- 
planning is paramount have failed to introduce anything useful to the human kind over 
the last century 38 ® 

Legacy 

Section 16.3 on Rent-Seeking studies influence (lobbying) and corruption that can be 
seen as agreements between a clique (aka Special Interest Group) and the State rep- 
resented by either the legislative or judicial bodies, the central or local government or 
a bureaucrat. This concept is the natural evolution of Rent-Planning within advanced 
industrialized democratic countries. 39 ® There are two noticeable differences. Firstly, 
rent-seeking as the name indicates is at the initiative of the rent-seeker and secondly, 
protection is not based on violence (or menace) but on an abuse of the legal system to 



create (and enforce) undue property rights (but not necessarily unlawful). 

Section 16.4 on liberalization looks at nationalization, privatization and deregulation. 
Nationalization (cf §16.4.2) involves State planning but without the rent generation 
motive. Historically, it is prevalent in crisis times when the government is forced to 
bailed out large firms to avoid further damage to the economy (e.g., banks, railways, 
airlines). Otherwise it follows ideology and, above all, the desire to control the economy 
but not to extract rents as in our original argument. The recent wave of privatization (cf. 
§16.4.3) is a modern case of rent-planning. Indeed, empirical studies have shown that the 
major motive was to refloat State finances rather than to improve the competitiveness 
of markets (at least not in the short term). 40@ Lastly, deregulation (cf. §16.4.4) aims 
at eliminating rent-planning practices that have survived in most regulated activities 
among which natural monopolies. 

16.3 Rent-seeking 

The topic of this section is best illustrated by Pareto (1906): "A protectionist measure 
provides large benefits to a small number of people, and causes a very great number of 
consumers a slight loss. This circumstance makes it easier to put it into practice." 

Advanced economies have more or less succeeded to eliminate violence and most ille- 
gal behaviors but this does not mean that all human economic activity is geared toward 
production. The presence of the State as a protector and a potential oppressor gives 
rise to a host of wasteful conducts such as corruption, red-tape, log-rolling, lobbying or 
cronyism that we cluster under the name of rent-seeking. The strategic analysis is un- 
dertaken in §7 on contest and conflict. We shall look here at the roots of the phenomenon, 
distinguish between corruption and lobbying, reflect on the inefficiencies generated by 
rent-seekers before looking at some remedies. 

16.3.1 Tipology 

Non productive behavior can be geared at the State or at competitors and goes under 
the respective names of rent-seeking and profit-seeking. Profit-seeking through legal 
means is analyzed in §7 on contest and conflict while deceptive competition is thoroughly 
studied in Part D on antitrust. For instance, entry deterrence is a legal defense where the 
incumbent invests excessively into production capacities, advertising or accumulation of 
intellectual property rights (IPR) such as patents or trademarks (cf. §12) to discourage 
entry on its protected market. Predation, foreclosure and restrictive agreements are 
anti-competitive (illegal) defensive strategies. Collusion is the artificial creation of a 



monopoly to increase industry profits. 

Most rent-seeking originates from the State decisions, voluntarily or not. Any regu- 
lation imposed for general quality purposes (e.g., security, hygiene or standards) neces- 
sarily bestows an advantage upon the firms that have passed the regulatory control as 
opposed to those who haven't (yet). Being in limited number, often a monopoly, license 
holders are shielded from standard competition and thus earn economic rents. This cir- 
cumstance makes the license valuable for an incumbent who fears losing it as well as for 
a potential entrant eager to grab a share of the cake. Incumbents, as a group, also vie 
to block entry by asking the State for a Numerus Clausus policy to maintain per capita 
profitability 41 ® Challengers, on the other hand, seek the removal of entry limitations to 
become new incumbents. 

Likewise, wealth redistribution programs increase the recipients' utility (at the ex- 
pense of the entire economy). The insiders enjoying the situation fear to lose while out- 
siders would like to come on board. In the same class of interactions, we find import 
tariffs and quotas for international trade, minimum and maximum prices for market 
control, taxes and subsidies for specific groups, be it an industry, a geographic area or 
a demographic class. One feature that makes rent-seeking a lasting entrenched phe- 
nomenon is loss aversion, a deep trait of human nature whereby one is ready to suffer 
much more to defend a possession than to conquer a new one. 42@ When a group benefits 
from a privilege or subsidy for some period of time, it starts to take it for granted. Once 
this belief is firmly held, the group will expand no few resources to defend it so that the 
State often prefers to ratify it as a right which then become inexpugnable. 

Natural monopolies are more subtle to interpret. If the regulation is ideally adminis- 
tered, it leaves no rent to the incumbent. The insiders benefiting from the regulation are 
then the consumers (who will insist on low prices) while the outsiders are the current 
and potential providers of the service who crave for a softening the regulation (a price 
raise). If the regulation is laxly applied, the roles are reversed: the incumbent seeks 
the status-quo, consumers seek a price freeze and contenders seek a change of provider. 
Lastly, a public monopoly (e.g., municipal agency) is vulnerable to internal rent-seeking 
because the bureaucrats in charge may distort the objectives of public service towards 
their own; it is however invulnerable to external rent-seeking in the short run, but might 
become a target if the government decides to privatize the service. 

In all cases, there are at least two contenders looking for an actual or potential rent 
(possibly with different personal valuations) that is decided upon by a State representa- 
tive, most often a committee. Hence, the study of rent-seeking as a strategic interaction 
in §7.2 requires to treat voters, politicians and bureaucrats as rational decision makers 
much like firms and consumer groups (cf public choice theory in §16.2.2). 



16.3.2 Corruption and Lobbying 

Just like market competition relies on price and non-price strategies, rent-seeking dis- 
plays a similarly dichotomy with corruption and lobby. As we shall see, the former is a 
more efficient channel but as it is often prohibited or costly to use, the second one has 
gained prominence. 

Corruption 

Svensson (2005) defines it as any abuse of entrusted power for private gain (cf Trans- 
parency International). One speaks of bribing when it involves a cash transfer. The 
recipient may be a bureaucrat, a regulator, a legislator, a manager (within a firm or any 
organization). The briber is any stakeholder in the decision process such as a firm in a 
procurement contest or in a regulatory evaluation, a lower level manager in labor tour- 
nament or anyone involved in a legal issue. Bribing is seen as disloyal competition under 
most legal systems and so is declared illegal per-se. This obviously limits the ability to 
use that instrument in environments where the law prevails. 

Corruption thus turns to alternative (and more costly) channels to transfer utility to- 
wards the targeted decision makers such as in-kind gifts (e.g., lunch, travel or club mem- 
bership) or legal spending in activities (e.g., charitable foundation) that bestow prestige 
onto the prey. Finally, the promise to give the bureaucrat or politician a lucrative job in 
the future can be used to transfer wealth inter-temporally Many countries try to limit 
the effectiveness of this artifice by imposing to its civil servant (and alike) a waiting pe- 
riod before one is able to work in the industry he previously regulated. It is often quite 
easy to go around the limitation (e.g., being employed by a subsidiary of the company 
active in an altogether different sector of the economy). 

In the cases that matter for industrial organization such as a telephone license or 
an import quota, the stakes are very large in comparison with the maximum bribe that 
can be realistically transferred. Indeed, bureaucrats (or anyone with a public profile) 
in advanced economies are neither poor nor rich, thus they can't change their lifestyle 
without being noticed and get into legal trouble. This puts an upper limit on the graft 
money they can accept whatever channel it comes through. To conclude, we offer Frye 
and Shleifer (1997)'s typology of corruption in governments. 
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Table 16.3: Levels of Corruption 



In advanced economies where the legal system is strong enough to effectively limit cor- 
ruption, the ubiquitous behavior to seek rent is lobbying whereby cohesive minorities vie 
to influence decision makers by pressuring them. The "friendly" pressure is the exposi- 
tion of one's arguments in favor of a particular course of action. For instance, a regulated 
monopoly asks for a price increase arguing that inputs have become more costly while 
the consumer group asks for a price freeze arguing that the incumbent makes fat profits. 
A potential entrant asks for a license to operate in a "numerus clausus" market such 
as taxicab or hertzian TV arguing that this will increase the supply quantitatively and 
qualitatively. Conversely, the association of architects asks for a limit on the number of 
degrees delivered by universities to maintain the highest quality (and avoid cut-throat 
competition). 

The complementary channel of influence for lobbying, as in "carrot and stick", is hos- 
tile pressure such as (unwarranted) judicial harassment, public protests, strikes or nega- 
tive media exposure (smear campaigns). This method lies at the frontier of legality since 
it often recurs to coercion and violence (cf §7.3.1 for a model of such behavior). 

The Lobby sector is said to employ some 15000 people in Brussels where most Eu- 
ropean institutions sit and twice as many in Washington DC in the US. The consensus 
based approach to EU policy-making and lobbying is often contrasted with the profes- 
sional and more aggressive US lobbying style. Reasons for these differences are to be 
found in the well established tradition of lobbying in the US that has attracted all so- 
ciety stake-holders; we find at the same time professional associations defending the 
strengthening of copyright laws and those defending the strengthening of innovation. 
The relative smallness of the European institutions in comparison with the US federal 
equivalents and the newness of lobbying in Europe makes it a preferred target of indus- 
tries representatives whose inputs are welcomed by authorities to frame new policies. 43 ® 



16.3.3 Inefficiency 

It is well known that when rational agents (firms, consumers, countries) struggle over 
a given prize, they expand resources to improve their bargaining position or fighting 
ability. If they could agree ex-ante to an exogenous redistribution, they would save the 
aforementioned resources. By failing to settle their dispute and engaging in actual con- 
flict, they generate a welfare loss. 44@ Tullock (1967), Krueger (1974) and Posner (1975) 
start to inquire this issue theoretically and empirically for publicly awarded rents (e.g., 
natural monopolies, protectionism, licensed occupations). 

These authors argue that if there is perfect competition in the rent-seeking activity 
(including free entry) then average benefit equals average cost in the long run i.e., the 
entire benefit, the rent, is equal to the entire cost, the resources expanded to secure the 
rent. In that extreme situation, the rent is fully dissipated but this does not mean that 
it is entirely lost in terms of welfare. Empirical studies find welfare losses amounting to 
a few percentage point of GNP. Though not as large as what the initial literature lead to 
believe, this loss is however far greater than the traditional dead-weight loss due to the 
exercise of market power. As we show analytically in §7.2.1, many elements concur to 
reduce the rent dissipation from such a high level. Let us review the arguments exposed 
by Fisher (1985). 

When, as in most cases, the license is to be renewed, the incumbent holds a consid- 
erable advantage. He knows better the demand and the cost, thus faces less risk which 
makes him a stronger rent-seeker. Furthermore, he holds up the regulator so that re- 
newal is almost automatic (cf §17.3.3) These features imply that contenders won't spend 
much effort to secure the monopoly since they mostly stand to lose; rent dissipation is 
then limited. By reversal of the previous arguments, we can expect the greatest amount 
of lobbying for the awarding of a new monopoly license. Yet a countervailing effect is the 
uncertainty of the rent; rent-seekers will tune their effort to the certainty equivalent of 
the rent which amounts to subtract a risk premium from the expected rent, hence the 
waste is once again reduced. 

With respect to private non regulated quasi monopolies (e.g., Microsoft), it must also 
be noted that many monopolies are obtained through luck rather than foresight i.e., as 
a consequence of technological changes or the evolution of customs. If the rent holder is 
a lottery winner, then the optimal amount of rent-seeking effort is zero and there is no 
social waste. The learning curve is also a natural barrier to entry that protects some of 
the monopoly rent. Lastly, one can draw an analogy with traditional rent to express the 
idea that a monopoly rent may not be contestable and hence will not be socially wasted 
(cf. §6.1.7). Under perfect competition, it is possible for a firm to earn a rent if she owns 
an asset of value such as management talent or good location. This does not contradict 



the fact that in the long run, her average cost is equal to the market price because the 
rent derived from the asset is incorporated into the firm's cost as an opportunity cost. 
The same phenomenon occurs for an incumbent monopoly for whom the valuable asset 
is the set of barriers to entry that he has come to enjoy (independently of how these 
barriers were erected). 

16.3.4 Remedies 

Since rent-seeking is created by the State in the first place, one simple way to eradi- 
cate this wasteful activity would be to scale down the State but this is often impossible 
because the rent appears as a by-product of the State's primary missions (cf §16.1.1). 
However, in some sectors it has been possible to "deregulate" and insulate the regula- 
tory agencies from pressure groups (whoever they are). The other remedy is to turn the 
spontaneous and wasteful rent-seeking game into an ordered welfare enhancing contest 
(cf. §7) that will select the most efficient firm and regulate it in a second-best fashion (cf. 
§17.2). The mechanism most commonly used for that task is an auction because it has 
the added feature of generating public income (cf. §22). 45@ Auctions are also used to sell 
occupational licenses for professions such as taxi-cab or architects. Under an auction, the 
identity of the winner does not depend on lobbying or bribing but solely on the amount 
pledged, hence money is best used in the bid and there is no wasteful rent-seeking. 46@ 
A licensee has also the right incentives afterwards to run his business efficiently (cost 
cutting, innovation) since the scheme is formally equivalent to a fixed price regulation. 
For those public services that come in limited supply and who cannot be auctioned due 
to equity considerations, the sole "rent-proof" allocation mechanism, that is also fair, is 
random allotment (e.g., kindergarten or social housing). It however suffers either from 
transaction cost in the re-trade market or from irremediable inefficiency when re-trade 
is prohibited (cf. §22.1.2). 

The case of industries or sectors displaying scale economies (e.g., natural monopoly) 
calls for a different treatment (cf. §17.2).) For the numerous cases where competition in 
the market is likely to result in monopolization (or cartelization) and direct governmental 
regulation seems unsatisfactory, Demsetz (1968) proposes 47 ® to switch to competition for 
the market i.e., franchise bidding. Initially, each contender is willing to bid up to the full 
monopoly profit for a license to exploit the market at will. This, as we know, is not welfare 
maximizing given that output will be restricted. The (constrained efficient) solution is 
to require a minimum level of quality and let candidates bid for the lowest customer 
price. Obviously, the regulation must provide with high penalties for failure to meet 
quality objectives and the provision of enough collateral by the firm to avoid ex-post non 



compliance (cf. §13.3.3 on the hold-up). 48@ If the best technology is available to all firms 
in the industry, the winner earns no rent since the Bertrand type competition inherent 
in auctions drives the customer price down to the average cost. The license holder keeps 
the right incentives to cut cost and innovate as before. 

Goldberg (1976) shows franchise bidding still suffers from capture as it relies on 
"cheap talk". Indeed, the awarding being based on a comparison of tentative figures, 
it is tempting to bid aggressively in order to win and then force a renegotiation arguing 
(falsely) for some "force majeure" event that has unexpectedly raised costs. The regu- 
lator faced with this claim would reduce welfare if he was to dismiss the firm and run 
the scheme a second time; on top of this, it would be an admission of its own failure 
which is the worst possible outcome for the agency's employees. The auction for the 
franchises of water distribution in French cities admired by Chadwick (1859) is a fine 
example of an efficient regulation gone captured by its clients. The historical providers 
have grown into three large utilities that currently share the national market on a geo- 
graphical basis; the situation is quite similar to a cartel. According to a 2001 report by 
the national assembly, franchises are not auctioned anymore and are the object of collu- 
sive agreements between firms and municipalities to unduly tax citizens (city expenses 
are channeled into the water bill). 49@ 

16.4 Liberalization 

Chapter 17 is entirely devoted to the regulation of natural monopolies and other indus- 
tries deemed crucial by the State. In the present section, we relate the liberalization 
process of the last decades. The term is generic and clusters two approaches geared at 
the specificities of the European and US traditions, namely privatization 50 ® and dereg- 
ulation. 

Liberalization tries to establish a light handed regulation over sectors where per- 
fect competition is not viable overall but where some segments can be made competi- 
tive while other, with natural monopoly features, remain regulated. As argued early 
on by Posner (1969), effective challenge but also potential competition is what prohibits 
a monopoly from abusing his dominant position. This is why, in theory, liberalization 
ought primarily to foster entry by reducing barriers (cf. §6.1.7) and, secondarily, improve 
direct regulation with incentives (cf. §17.3). 

Before studying the liberalization trends, it is worthwhile to take a look at the past 
and see when and why the State decided to take an active or passive role in the produc- 
tive sectors of the economy, either by delegating tasks or by controlling them closely. 



16.4.1 Grants and Concessions 

To raise revenue, the State sells specific rights or or delegates special tasks to private 
individuals or local governments, thus taking a passive role towards economic activity. 
Historical examples abound where the ruler (or parliament) uses his privileges to issue 
a letters patent granting (or leasing) an item such as a piece of land, an activity to be 
performed over royal land (e.g., fishing, hunting), an office (e.g., notary), a title, a pen- 
sion, a patent for an invention, a right to hold market or fair, an exclusive right to some 
activity over some territory (e.g., mill wheat, work wool, operate a lottery), a pardon, a 
sinecure or a judicial commission. 51 ® Another important legal act is the municipal char- 
ter awarded (i.e., sold) to a city in order to free its inhabitants from military service and 
acquire monopoly rights over some trade or industry (which still weights on the local 
economy centuries later). Even taxation can be delegated (outsourced) for a yearly fee 
(cf §16.2.4). 52@ In some cases, the tax base (e.g., city, county) is rich enough to buy out 
the perpetual right, turning the transaction into a privatization. 

The provision of public services and goods is also delegated to the private sector, 
especially when it involves large investments or requires specialized knowledge. The 
instrument is called concession whose latin meaning is "put in place of" (the prince). 
Since the rent extraction motive is less present in those cases, financing through vol- 
untarily agreed taxation is generally observed. As recalled Bezancon (2005), 53@ state 
delegation through concession develops in stages throughout history. In Roman times, 
delegation pertains to public works such as thermal baths, market places, roads, ports 
or aqueducts. Private parties also run services of public interest such as the post office, 
tax collection and money minting. Concessions for public services and infrastructures, 
often proposed by private parties, abound. From the XV th til the XVII th centuries, new 
delegations include post coach, stage coach, marsh draining, mining, canals, roads and 
bridges. In the XVIII th century, waste collection, street pavement and even city districts 
construction or renovation are offered in concession. Contracts are refined and include a 
10 years maintenance obligation for contractors. In the next century appear concessions 
for water distribution, street lightning, railways, gas distribution, streetcars, telephone, 
electricity, subway and airways. This topic is further elaborated in §17.1 on the natural 
monopoly. 

Governments have now at their disposal a large spectrum of options for ownership 
and operation of public utilities. Table 16.4 presents the World Bank's glossary on public 
private initiatives (PPI). 

Public Enterprise In-house production 

Outsourcing Short term contract (< 3 years) involving no commitment for the firm and no or 
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Table 16.4: Spectrum of Concession and Franchise 

low risk. 

• Input supply: procurement of raw materials 

• Facility Maintenance: take advantage of economies of scope and scale to maintain build- 
ing and other standard assets. 

• Technical assistance: take advantage of the specialization of consulting firms to develop 
IT or management systems. 

• Sub-contracting: take advantage of economies of scope and scale to delegate a standard- 
ized task such as billing to an outsider. 

Management contract Short term without strategic risk (insulation from politics) but some 

commercial (demand related) risk. 
Delegation contract (concession) The government sets the duration, rights, obligations and 

tariff, the latter defining implicitly the degree of risk for the firm. Asset ownership remains 

public but the firm maintains them. 

• Lease: term = 5 years, no investment, commercial risk and freedom of action. Revenue is 
either fixed or depends on performance. 

• Franchise: term = 15 years, new investments, commercial risk and limited freedom of 
action. The franchise is generally awarded to the highest bid which can be a once-for-all 
fee (bonus), 54@ a royalty (percentage of future revenues) above a minimum threshold set by 
the franchiser or a share of future profits (cost need be audited). 

• Concession: term = 30 years, large investment in either greenfield (aka Build-Own- 
Transfer contract) or rehabilitation. 

Build-Own-Operate is an example of Public Private Partnership (PPP). A design-build-maintain 
(DBM) with or without operation creates a lifecycle responsibility and thus provides an in- 
centive to deliver better quality in the initial design and construction of the project because 
the firm will have to bear any additional maintenance and repair costs if the initial quality 



is inadequate. 
Divesture The government sells to the private sector the right to provide the service. 

• License: limits entry for activities with network externalities. Heavy handed regulation 
remains. 

• Sale (privatization): the light handed regulation and freedom of entry are made possible 
by technological developments that have eliminated the historic element of natural mono- 
poly. 

Private market The service is provided by unregulated firms freely competing. 

16.4.2 Nationalization and Municipalization 

In this section, the State takes an active role in the economy. Nationalization occurs 
when the State buys a private firm and turns it into a State Owned Enterprise (SOE). 55@ 
This relatively recent (about a century old) phenomenon is prevalent in crisis times when 
the State is forced to bailed out large firms to avoid further damage to the economy (e.g., 
railways and airlines). Otherwise there is an ideologic motive to control the economy 
and steer its development. 

The direct involvement of the State with productive businesses is linked to the in- 
dustrial revolution. New services like sanitation, light, heat or communications require 
heavy network investments to cover large geographical areas (e.g., an entire country). 
With the exception of the US and UK, national financial markets are underdeveloped 
and unable to sustain the required effort so that public finance and thus ownership 
comes into the picture. At the city level, the desire to avoid anarchical duplication of 
networks, to guarantee a high level of quality and to eliminate corruption, makes direct 
municipal oversight unavoidable. Either a municipal agency is created or early private 
firms are bought out (municipalization). Finally, in heavies industries such as mining, 
chemicals or petroleum, the choice made by the State of building SOEs, either directly 
or by purchasing private firms responds to a desire for modernization and growth pro- 
motion. 

Political ideology also plays a significant role for the nationalization push during the 
last century. Most of the activities under concession during the liberal era are either na- 
tionalized or municipalized by leftist governments as a mean to redistribute wealth from 
the capitalist owners, the bourgeoisie, towards the less affluent classes of society. Like- 
wise, authoritarian governments (e.g., soviet and fascist) being ideologically prone to di- 
rect control, nationalize large parts of the economy to advance their particular goals. 56@ 
The result is a State owned economy in soviet countries and a State owned conglomerate 
of industrial firms in fascist countries like Italy or Spain. 

In the aftermath of the second world war, most Western European countries (even 



the UK) nationalize large parts of their economies to accelerate reconstruction, apply 
long-term planning and remain master of their economic development (cf the national 
interest concept). The last major episode of nationalization in advanced economies takes 
place in France in 1982 but is overturned from 1986 on. Lately, some latin american 
countries have started to (re)nationalize industries that were privatized in the 1990's 
arguing they failed to deliver on their promises. 

Nationalization is unknown in the US where the private ownership of assets has 
never been called into question. Apart from the Postal Service or the Tennessee Valley 
Authority (power plants), there are hardly any federally owned firms. However, a quar- 
ter of municipalities own electric distribution companies and most public transit systems 
(tubes and buses) are municipally owned. 

16.4.3 Privatization 

The first modern privatization is Germany's carmaker Volkswagen in 1961, followed by 
the power and mining company VEBA in 1965 (now E.ON). 57@ The first wave of priva- 
tizations was launched in Chile between 1974 and 1978; two hundred SOEs were sold 
to private investors that later united them into six large conglomerates (cf. Yotopoulos 
(1989)). 

However, the big push for privatization in the advanced economies is the worsening 
of public finances starting in the 1970s. 58@ Concomitantly, SOEs suffering from under- 
investment and over-employment make losses; privatization thus appears as a remedy 
for two ills: improving the efficiency of these firms and refloat the treasury. The pub- 
lic finance motive that was at the origin of concessions is again at the heart of many 
privatizations and the fashionable use of concessions today. Large scale privatization is 
associated with the Thatcher UK government of the 1980s (and 1990s under John Ma- 
jor). The popular success of the British Telecom share issue privatization (SIP) in 1984 
can be seen as the starting point of a worldwide wave of privatizations. In the UK, SOEs 
amounted to 10% of GDP and to 8% of employment in 1979; these figures were cut by 
privatization down to 5% and 3% in 1992 and virtually zero in 1998. In France, SOEs 
reached a maximum of 9% of the work force in 1985 but have since fallen down to 3% in 
2004. 

The goals of privatizations are numerous: raise revenue for the State, promote wider 
share ownership among the public, promote economic efficiency and competitiveness. 
The later objective is supposedly achieved by reducing government interference in the 
economy and by putting privatized firms under market discipline. All privatizations 
of natural monopolies go hand in hand with the enactment of a law establishing the 



regulatory framework for the sector; it applies to the newly privatized firm but also to 
challengers whose entry is to be facilitated. 59@ A positive byproduct of privatizations has 
been the development of financial markets since large amount of money were turned into 
equity. Figure 16.1 and Table 16.5 taken from the Privatization Barometer display the 
trend of privatization in Europe over the last two decades (PS: private sale, PO: public 
offering). 
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Figure 16.1: Privatization trends in Europe 
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Table 16.5: Privatized European Industries 



According to Vickers and Yarrow (1988), the first wave of UK privatization is politi- 
cally motivated; the underpricing of shares aims at attracting votes from median income 
households. Tight regulation guaranteeing low prices and high quality only came with 
the second wave (after 1987). On the positive side, Megginson and Netter (2001) report a 
general empirical agreement over the fact that private firms are productively more effi- 



cient than SOEs. On the negative side, their analysis suggest that governments balance 
competing economic, political, and financial objectives when privatizing a SOE, so that 
economic efficiency is only marginally enhanced. Most privatizations appear to be the 
outcome of a rational alliance between management and government to create a pure 
private monopoly out of a public monopoly. The structure of the firm is left untouched 
i.e., there is no separation of activities or dismantling; if necessary, the accounts are 
sanitized before presenting the firm to international markets. 60@ Lastly, the statutory 
protections that shielded the public firm from competition are more or less maintained 
in the privatization act on grounds of maintaining productive efficiency (the natural 
monopoly property of technology). All of this means that the private firm will enjoy 
rents in the future, so that its shares are highly valuable which translates into a large 
revenue for the State. To avoid internal opposition from unions, employees get shares 
at a preferential price. For the management, the outcome means more freedom and less 
supervision by the regulator (e.g., treasury or public utility commission) and also avoids 
falling into the iron hand of a majority shareholder since the shares are either sold to 
the public at large or in small batches to institutional investors. This way, the board can 
vote itself large salary increases, generous stock options and other assorted perks. This 
analysis of the recent privatization process conforms with our view of the State as a rent 
planner (cf §16.2.4). 

It seems that competition (and its benefits) has developed in those sectors where a 
European directive has forced the opening of national markets, turning all the quasi mo- 
nopolies (so called national champions) into oligopolistic contenders inside a European 
league. This pro-entry stance, advocated by Posner (1969), is also at the root of the US 
deregulations (cf. Megginson (2007) for a recent overview). 

16.4.4 Deregulation 

The term refers mostly to the US whose public service providers are mostly private but 
were tightly regulated; deregulation then means introducing market competition in seg- 
ments where technological and demand characteristics makes it worthwhile and change 
the regulation of natural monopoly segments towards a lighter mode giving better incen- 
tives to cut cost and improve quality of service. The US deregulation started during the 
1970s under the combined influences of macro-economic shocks and the academic litera- 
ture of the Chicago school of thought. In 1977, 17% of US GDP was produced by tightly 
regulated firms, a figure that was reduced to 6% after twelve years of intense deregula- 
tion in the areas of telecommunications, financial services, airlines, natural gas, oil, road 
shipping, railroads, banking or cable TV. 



In Europe and elsewhere, regulation was designed from scratch for the formerly SOEs 
that were privatized. Most public services necessitate an infrastructure (e.g., network) 
which is essential in the sense of having the natural monopoly property. The EC has thus 
decided to maintain regulation on that segment only. The upstream and downstream 
segment, most importantly retail, are open to competition; the infrastructure owner is 
forced to give access to competing retailers on terms of cost and not lost profits. This 
principle has been applied for the liberalization of telecommunications, energy, transport 
and postal services. 

One clear example of successful deregulation is air transport where for decades, na- 
tional airlines had a monopoly over domestic routes and shared in a cartel like manner 
international routes through the I ATA association; cabotage (right to operate within the 
domestic borders of another country) was prohibited, fares were high and many airports 
in regional capitals were underused. Deregulation started in the US 61@ with the Airline 
Deregulation Act and the International Air Transportation Competition Act (1979); Eu- 
rope followed suit in 1997. Data from the US Bureau of Transportation Statistics show 
that the legacy airlines operating extensive national and international routes enjoyed in 
1990 an 80% market share over domestic US routes which came down to 50% in 2006. 
Figure 16.2 based on data from Eurocontrol show a similar trend for Europe. 
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Figure 16.2: Market Share of Low-Cost Airlines in Europe 



According to Morrison and Winston (2000)'s data on US airline prices regarding the 
1978-98 period, deregulation brought a 25% fare reduction with respect to a regulated 
price that would take into account productivity increases. Since differential pricing has 
been permitted, segments with inelastic demand pays more than under regulation but 
overall, 80% of the passengers enjoyed fare reductions. Figure 16.3 illustrates this find- 
ing. 
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Figure 16.3: Average airfare per kilometer 



Chapter 17 
Regulation 



Etymologically, to "govern" is to rule a country but it also means to steer a boat. This 
second meaning makes clear that the ruler is not supposed to row or, in the language 
of economics, that the government ought not participate directly in the economy. Reg- 
ulation then, is that branch of public action that specifically aims to pilot the economy 
towards the fulfillment of social objectives. The previous chapter, dedicated to the State, 
has explained more thoroughly why there are social objectives and what is their nature. 
In the present chapter, we study the regulation of network industries and natural 
monopolies from an analytical point of view. In the first section, we explain why some 
activities are regulated. We then analyze the objective and constraints for an ideal reg- 
ulation; we thus obtain an ideal level of revenues for the regulated firm and an ideal 
price structure for the goods and services it produces. In the third section, we study 
how practical regulation evolved from cost based norms to incentive schemes and what 
constraints limit their effectiveness. 

17.1 Why Regulate 

17.1.1 Context 

Transport (roads, canals, trains, airlines), telecommunications (radio, telephone, TV, in- 
ternet) and utilities (electricity, gas, water, sewage) are natural monopolies i.e., indus- 
tries characterized by very high sunk costs (cf §2.1.3), relatively low marginal costs and 
increasing returns to scale (IRS) due to network externalities (cf. §25). For these sectors 
but also many other capital intensive activities typical of the modern era (e.g., chemicals, 
metals, oil, cars), the average cost of a firm is decreasing with total production (over a 
large range of output). As we recall in §2.1.3, cost minimization requires the pooling 
of production which is typically achieved by the integration of independent firms into a 
small number of large players. It is therefore no surprise that the development of these 



activities at the end of the XIX th century lead quickly to local (in a geographic sense) or 
even national monopolies in all industrializing countries. Nowadays, these sectors are 
oligopolies i.e., competition, if any, takes place among few firms. 

The economic issue at stake with this new market structure characterized by "com- 
petition among the few" is market power, the low degree of competitiveness and the 
allocative inefficiency it may generate. Recall indeed from the theory of monopoly (cf 
§3.2) and oligopoly (cf. §5.1.3), that the fewer the competitors, the higher the prices and 
the lower the welfare. The dilemma posed by natural monopolies is that productive effi- 
ciency is achieved with a small number of firms (possibly one) while allocative efficiency 
calls for many contenders. We now recall how and why the State became closely involved 
in the management and regulation of natural monopolies. 

Genesis of "Natural Monopoly" 

Mosca (2008) recalls how the concept gradually evolved until its current day acceptation. 
Smith (1776) speaks upon a supra-competitive rent for high quality land citing "some 
vineyards in France (Bordeaux) of a peculiarly happy soil and situation" but does not re- 
late this fact to the idea of monopoly. Malthus (1815), writing specifically on rent, intro- 
duces the dichotomy between natural and artificial monopolies i.e., arising either from 
nature's gift or from the ruler's benevolence (cf. §16.2.4 & §3.1.1). Like all economists of 
his time, he loathes the second and lauds the first, thus acquiescing to the extraordinary 
rents they produce (cf. §8.2.4). 

Mill (1848) extends the definition further to include scarce skilled labour but more 
importantly adds a new category, the "practical monopoly" as arising from superior tech- 
nology, capital requirements or cartelization. The first case, concretely canals and net- 
works for water, gas and rail, is what is now understand as a natural monopoly. His 
stance is normative when stating "how great an economy of labour would be obtained 
if London were supplied by a single gas or water company instead of the existing plu- 
rality". Without being explicit, he seems to fear an abusive monopoly and thus calls for 
public oversight but oscillates between concession (cf. §16.4.1) and municipalization (cf. 
§16.4.2) without much of an explanation. 

Cournot (1838) is the first author to take a descriptive approach to natural monopoly; 
he concludes that (de-facto) monopoly is the only stable market structure, because she 
enjoys economies of scale over the entire range of market demand (cf. §10.1 on entry bar- 
riers).^ Dupuit (1852) also believes this outcome ineluctable 2@ for railways and ascribes 
it to scale economies, capital requirement and incumbent advantage (better geographi- 
cal sitting of the network) which prohibits an entrant from recouping his sunk cost. As 
recalled by Numa (2011), Dupuit advocates regulation as a second-best policy since he 



believes the government failure (bureaucratic cost) to be a lesser evil than the market 
failure (monopoly pricing). Walras (1875) first uses the vocable "natural monopoly" in its 
modern sense and, more importantly explicitly treats railways as a public service, thus 
calling for State management (cf. §16.1.1). 3@ Observing that rights-of-way for building 
networks can only be granted to a few firms, he concludes that these end up merging or 
colluding, thereby producing a monopoly 4 ® 

Edgeworth (1913), without using the actual vocabulary, offers the first analytical 
treatment of the natural monopoly. Sraffa (1926), criticizing the logical foundations of 
Marshall (1890)'s theory of the firm, argues that increasing returns to scale are incom- 
patible with perfect competition (so that oligopoly or monopoly is likely to appear). Oddly 
enough, the ensuing "cost controversy" remained purely theoretical and never mentioned 
natural monopoly as the archetypal case. 

Rationalization 

A definition devoid of ambiguities for "natural monopoly" would be "an industry where 
the socially optimal number of firms is one"; obviously it is not very instructive. We shall 
thus review how different schools of thought approach the natural monopoly and explain 
the ubiquitous presence of regulation. 

Evil Monopoly ? The public interest theory of the State (cf. §16.2) has adopted the 
historical narrative and holds that an unregulated natural monopoly evolves from cut- 
throat competition to monopoly. Since both market structures are inefficient, the former 
because of price instability and looming financial crisis, the latter because of monopoly 
pricing, the case for intervention is clear in order to stabilize the sector (allow sustained 
growth) and avoid its monopolization. 

Regarding the natural tendency towards monopoly, most economists agree that mono- 
poly is inevitable but it is not per-se condemnable, only its abuse is (cf. §8.2.4). For in- 
stance, monopoly rents over land or minerals are hardly ever opposed on moral grounds 
as they participate to an efficient management of a scarce resources (cf. depletion in 
§18.1.2). Posner (1969) argues that rents arising from advanced technology (e.g., iPod) or 
commercial wit (e.g., Microsoft Windows) should receive the same treatment. Recall that 
if society aims at fostering innovation and new wealth creation, it must promise supra- 
competitive profits to lure entrepreneurs into such risky activities. Monopoly achieved 
through business excellence is thus commendable rather than reprehensible. 5@ 

Hence, the only justification for regulating a network activity is to classify it as a 
public service: monopoly pricing would be abusive because society desires an extended 
level of consumption or universal access. 6 ® 



Evil Regulation ? The public choice theory of the State holds an opposite rent-seeking 
view: natural monopoly industries, fearing endless cutthroat competition, negotiate pro- 
tection with authorities (cf §16.3 & §16.2.4). At the same time, there are grounds to 
believe that Schumpeter (1942)'s "creative destruction" is at work in natural monopo- 
lies. 7@ If so, cutthroat competition is healthy as it fosters innovation and cost cutting. 
It is then a source of dynamic efficiency. Next, monopolies being of a temporary nature, 
they generate only a small allocative efficiency loss. Weighting the benefit against the 
loss, a clearcut conclusion emerges: the market should be left unaltered and all regula- 
tions eliminated. 

This position, however, ignores that regulation in all advanced economies and all 
sectors has always appeared as the outcome of an intense economical, political and social 
conflict between opposed factions. 

Contracting The intertwining of private and public interests in natural monopolies 
can be best understood from the perspective of transaction costs (cf. §13.3.3). Notice 
indeed that establishing a network service in a purely private fashion is a daunting task 
as it requires transacting with many individual land owners under the threat of halting 
the entire project if a single negotiation fails. 8@ It is thus an economizing solution for 
a firm to seek a right-of-way from authorities (cf. §16.1.2); this is the path followed in 
all sectors at all times in all countries. From this moment on, a bilateral relationship 
is established between a seller, the firm building and operating the network and the 
buyer, a public body aggregating the interest of the numerous future clients, households 
and businesses. In the ensuing bargaining, the regulator trades quality, geographical 
extension and prices against stability and protection from entry to allow the firm to 
recoup her high sunk cost. 9 ® 

What characterizes most clearly this relationship are the specific investments made 
by each party and the subsequent hold-up problem (cf. §13.3.3). Indeed, the firm lays 
down an immobile network with virtually zero value in alternative use, thus fears oppor- 
tunistic price ceilings from the city (or any other costly demand). Likewise, the city, once 
it has permitted the building of the network, can not repossess it if she is dissatisfied 
with the service rendered by the firm. To avoid mutual hold-up, partners have to use 
detailed long-term contracts with revision clauses. The concession model (cf. §16.4.1) 
used in France for several centuries 10 ® has been found by Priest (1993) to characterize 
the early regulation of US utilities. The ensuing evolution into a state regulation with 
public utility commissions (PUC) during the last century 11 ® is attributed to an increase 
in the minimum efficient scale (improved technology) and the need for an independent 
body to stop municipal opportunistic behavior (cf. Knittel (2006)). 



17.1.2 Pricing Quandary 

We now inquire about the socially optimal pricing policy for a network firm or natural 
monopoly. The Pareto conditions for global economic efficiency (cf. §2.3.3) call for margi- 
nal cost pricing everywhere. Yet, applying such a policy to natural monopolies is bound 
to generates operating losses since the average cost remains systematically above the 
marginal cost (i.e., fixed cost are not covered). We look in turn at two opposing and hotly 
debated compromises to solve this quandary (cf. Ruggles (1949)). 

Marginal Cost Pricing 

The archetypal example of marginal cost pricing is found in Dupuit (1844)'s bridge: once 
build, it is inefficient to set a positive passage price since there is no variable cost in- 
volved, in the absence of congestion (cf. §25.3). Thus, we should "let bygones be bygones" 
and make it a free service. The same applies today to digital goods (cf. §12.3). Obvi- 
ously, the service is available today because an investment was made in the past which 
raises the question of its financing and the just remuneration of the risk taken by the 
entrepreneur. 

For a public service, the government can cover the operating losses of a natural mono- 
poly by resorting to public funds. The ideal instrument to raise money without impinging 
on the economy is the lump sum tax, a fixed amount to be paid by every household. The 
problem with the practical implementation of this scheme is that the aggregate level of 
all fixed costs of all regulated activities can sum up to a significant share of the country's 
GDP which means that poor people may not be able to pay their lot. Once we start to 
exempt poor households from the payment, the lump sum tax has become an income tax. 
Thus, the financing scheme has to be a mix of income and commodity taxation which are 
known to be distortionary An inefficiency is thus corrected by the creation of another 
inefficiency 12 ® 

Vickrey (1955) summarizes the problem by introducing the concept of marginal cost 
of public funds, the fact that raising 1€ of subsidy requires withholding more than this 
from tax payers because the administration is a costly redistribution mechanism (e.g., 
wages of bureaucrats) and because taxes distort the economy (cf. §2.3.3). The US gov- 
ernment uses a value of 25% for cost-benefit analysis and recent estimates by Kleven and 
Kreiner (2003) show a range from 10% to 55% for Japan, US, UK, France and Germany 
in increasing order. 



Average Cost Pricing 

The alternative pricing policy, advocated by practitioners since the inception of utilities, 
is that "every tube must stand on its own bottom" i.e., the price of a service should be 
no less than its average economic cost. The fundamental underlying concept here is 
the cost of capital i.e., the return needed by a potential investor to invest in that class 
of risk. For a private firm, regulated or not, pricing above average cost is a necessary 
condition to guarantee the continuity of supply for otherwise, no new investment would 
take place so that the service would deteriorate and possible come to an halt. Obviously, 
a private (profit maximizing) firm will more than agree with this view since its objective 
goes beyond covering cost as she seeks to obtain an extraordinary profit. For a public 
firm (state owned), computing economic cost using an economy wide cost of capital is a 
way to identify the most useful investments for society. 

When the firm produces a variety of goods and services, the average cost rule boils 
down to find a vector of prices generating enough revenue to cover economic cost (includ- 
ing the remuneration of capital). The thorny question is how to set relative prices as this 
implicitly defines cross-subsidization among the product lines (cf §17.3.4). Likewise, 
when all natural monopolies are pooled inside a State owned conglomerate such as INI 
in Spain or IRI in Italy, there is only one budget constraint for the whole public sector 
but the permanent surpluses of some activities such as petroleum are bound to compen- 
sate for the chronic deficits of others such as postal services. Socializing fixed costs over 
the whole economy makes it impossible to know who pays for what and check whether 
the general redistribution objectives of the government are met. Furthermore, this con- 
flation of costs is opposed by almost everyone since there is always a service we do not 
consume but which we end up financing. 13@ In some sense, the self financing of each reg- 
ulated activity through an adequate price is a transparent and fair method which has 
gained much political support. The only remaining problem, discussed at the end of the 
section, is the allocation of common cost. 

Historically, it must be noted that average cost pricing has always prevailed for regu- 
lated private firms since it is politically unsustainable to channel public money to private 
firms (as required by marginal cost pricing). It has also gradually become the rule for 
state owned enterprises (SOE) since the 1970s when economic downturn started to put 
pressure on public finances. The logical conclusion of the dissociation of utilities from 
the central state is privatization (cf. §16.4.3). We thus let end-users pay the cost of ser- 
vice directly through prices instead of paying for the fixed cost indirectly through the 
State tax system. The privatized firms are nevertheless subjected to some minimum re- 
quirements regarding quality of service, geographical coverage and guaranteed end-user 
prices or any other public service objective (cf. §17.3.4). 



Welfare Horizon 

The marginal vs. average cost controversy originates in the distinction between variable 
and fixed cost or in the division of cost into past and current. Wiseman (1957) judges this 
dichotomy as artificial since for all public services we may think off, when we increase 
consumption by turning on a light or opening a faucet, we do not generate additional cost. 
Indeed, the resources have already been committed and paid hours or days in advance by 
the utility (who take decisions based on the statistical expectation of aggregate demand). 
This means that, like in the Dupuit (1844) bridge example, the service should be given 
for free to satisfy the short-term efficiency conditions. 

At the other extreme of the lens, the fixed cost problem originates in the durable and 
lumpy nature of the assets needed to provide the service e.g., bridge, plant, reservoir. 
Recall now that the Pareto efficiency conditions (cf §2.1.1 & eq. (2.8) or eq. (2.9)) derive 
from an arbitrage argument which presupposes that every factor is perfectly mobile and 
divisible. Applying the welfarist argument to public services and their long term invest- 
ments thus requires us to consider an extremely long time period e.g., 20 years. Once 
we do so, and consider a large country with numerous plants, we can fine tune output by 
planning obsolescence and replacement with the effect that all of cost become variable 
and none is fixed. Over this long term, the marginal cost of service is then the cost of ex- 
panding marginally all assets and inputs to serve an additional client for that duration. 
An example is treated in §25.3 on Peak Load Pricing. 

What these extremes show is that the marginal cost pricing controversy reduces to 
selecting an horizon for welfare maximization. We are thus warranted to give more 
weight to the present (and lower prices) if technological progress reduces the investment 
cost for the future (since current pricing is depressing future income for the utility) and 
if knowledge or some other positive externality is generated by the greater use that low 
pricing today generates. 

Club Principle and Common cost 

To maintain short-run efficiency and still cover fixed cost, we can resort to a two-part 
tariff with a unit price equal to marginal cost and a subscription fee equal to per-capita 
fixed cost or any other formula carrying a redistributive purpose inso far as the total 
collected over customers make up for the entire fixed cost. 

If the fixed cost can be exactly individualized, there is no wealth redistribution and 
everyone pays his fair share which is exactly by how much the system cost is increased 
to serve him. An example would be production of a good plus individualized transport 
by scooter. Observe then that the consumer is buying two products, good and delivery. 



Yet, in may instances, scale economies are achieved by bundling transport into a larger 
carrier that visit several clients. More generally, it is necessary to pool production in 
order to harness positive network externalities. The corresponding common cost must 
then be allocated among clients in a manner furthering the government's social criteria. 
Hence, regulation is required. 

Libertarians offer an alternative avenue free from government intervention. Buchanan 
(1965)'s "club principle" is an equal sharing method that gathers potential clients of a 
service with a view to either produce it themselves 14@ or procure it from an independent 
producer (cf. §13.3.3). According to this principle, citizens also decide on the charac- 
teristics of the service such as quality as well as on its finance scheme. Because people 
voluntarily join the club, its coming into existence will generate an acceptable wealth 
redistribution even if there is price discrimination so that welfare is increased i.e., it is 
a Pareto improvement. 

Several critics can be addressed to this rosy view. Firstly, the club operation involves 
a political decentralization since the central government is in effect agreeing that the 
group of citizens will carry a policy in accordance with its general social criteria. Such a 
conclusion is by no means obvious since the gathering obeys cost minimizing considera- 
tions and geographical constraints without relations to social realities that traditionally 
guide the government. A second problem with the club view is that the utility has the up- 
per hand and therefore does not leave much surplus to the citizen community. 15@ Indeed, 
to take advantage of the underlying scale economies, a large minimum size of operation 
is needed which requires expertise in technology and finance. These in turn are found 
in profit maximizing corporations who end becoming natural monopolies. To avoid an 
abuse of dominant position, authorities have to step in. 16@ To conclude, except for the 
unlikely cases of where all dimensions of the service can be ascribed to the final users 
independently, the pricing of a public service (at least the subscription fee) requires al- 
locating common cost by making interpersonal comparisons which ought to follow the 
general social criteria of the government as outlined in the contractual view. 

17.2 Ideal Regulation of a Natural Monopoly 

We shall study regulation under the proviso that a regulated firm finances her cost out 
of direct revenues. This means that the regulator, also known as the social planner 
in the public economics literature, maximizes welfare under the budget constraints of 
the regulated firm. It is customary to speak of a second best or constrained efficiency 
approach and reserve the label first-best or full efficiency to the hypothetical case where 
the deficits generated by marginal cost pricing can be financed out of lump-sum taxation. 



17.2.1 Homogeneous Demand 

In the simplest case of a regulated firm producing a single good, the solution to the above 
program, already hinted at by Edgeworth (1910), is readily identified on Figure 17.1. The 
first-best which maximizes welfare without any restriction is the quantity q* equating 
WTP and marginal cost; since there is a fixed cost, the average cost is above the marginal 
cost so that selling q* units at price p* generate operating losses equal to the stripped 
area (and also to q* x [AC{q*) - p*)). Since welfare increases with the quantity produced 
(and consumed), the minimal loss is achieved by reducing the quantity from the first- 
best level till the level q where the budget constraint qP{q) > C{q) becomes satisfied; this 
occurs when the price is equal to the average cost (P = AC >C m ). 
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Figure 17.1: Cost Structure for a Natural Monopoly 



Notice however that the first-best quantity q* is only a candidate because one must 
check that the maximum welfare W* , the area between the demand and marginal cost 
curves, covers the fixed cost F. If this is not the case then society is better off closing 
the plant, saving F and renouncing to the surplus W*. To take the correct decision, the 
regulator has to compute the consumer surplus which is quite difficult as it crucially 
depends on the willingness to pay of people who always consume (i.e., whatever the 
price). To understand this, imagine that the demand drawn on Figure 17.1 corresponds 
to a state of high demand (Dh) while a rotation to the left around the efficient point 
[q*,p*) corresponds to a flatter and lower demand curve (D/). Assume that the fixed cost 
stands between the surpluses W* and W* . According to the state of demand, the service 
should or shouldn't be produced. Notice that small price variations will never succeed 
to reveal the full shape of the demand curve while large price variations (governmental 
experimentations) are out of the question as they would almost surely trigger a popu- 



lar insurrection. Vickrey (1948) argues that this uncertainty is a additional reason to 
avoid marginal pricing and use instead the (second best) average cost pricing because 
the later would almost automatically reveal whether the activity should be produced or 
not. Indeed, when demand is low, the average price p rises and demand shrinks thereby 
revealing that the service hardly produces anything worthwhile to society. 

| The (second best) efficient regulation of a single activity is average cost pricing. 

17.2.2 Heterogeneous Demand 

The ideal simultaneous regulation of all network industries and other activities involv- 
ing decreasing average cost is derived in a general equilibrium framework by Boiteux 
(1956) and refined by Dreze (1964). While it builds on the previous simple result, care 
has to be taken of heterogeneity. A regulated firm, like any other firm, usually sells its 
basic good or service to several types of clients, like households and professionals or to 
customers who have fairly different uses of the product. This disjunction brings us back 
to one of the everlasting problems of economics, how should we allocate the fixed cost be- 
tween the various classes of customers? Abstaining for the moment from redistribution 
concerns (i.e., who should pay), we solve this problem on terms of efficiency alone. 

Let us consider a good or service bought by households and businesses. The regulator 
would like to price each segment at marginal cost but must accept to raise the prices in 
order to allow the firm to recover its fixed cost. Raising the price in a segment involves a 
welfare loss and a profit increase. Let us then tune prices so that the profit increases by 
100€in each segment; this goes with a welfare loss of say 130€ in the business segment 
as opposed to 160€ in the household segment. It is then pretty clear that more sacrifice 
should then be ask from the business segment since each 100€ of cost recovery involves a 
waste of only 30€ as compared to a waste of 60€ in the household segment. Practically, we 
can start from prices equal to marginal cost and then perform a series of small sacrifices 
(price increases) in all segments following this equi-marginal principle until the budget 
constraint is satisfied (cf §2.1.1). From this arbitrage reasoning, we deduce 

The efficient pricing rule for welfare maximization of an heterogeneous demand under 
a single profit constraint is to equate the ratio of welfare loss to profit increase in all 
segments. 

We now relate this finding to the theories of perfect competition and monopoly. Let 
qt denote the quantity consumed by segment #i and C m j = |^ the marginal cost of ser- 
vice. The willingness to pay for that good is Piiqt) while the marginal revenue from that 



segment is R m j = pt + qiP' v The welfare loss of a price increase in segment #i is Pi - C m j 
while the corresponding profit gained by the firm is C m j-R m j. Stating that the ratio of 
the former over the latter is constant across segments implies that optimal quantities 
qi,...,qi,...,q n satisfy 

P\ ~ ^m,l Pi ~ ^m,i Pn ~ ^m,n . M 7 1 1 
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where A > is greater, the greater the overall sacrifice (fixed cost). A simple manipulation 
of (17.1) yields the Ramsey-Boiteux formula 17 ® 

a + X)(p i -C mJ ) = -Xq i p' i o % i e i = ^ l (17.2) 

V'—C 

for all i < n using the Lerner index S£i = — — (cf. eq. (3.4)) and the own-price elasticity 

of demand e, = %- L . 

This efficient pricing rule is qualitatively identical to the characterization of imperfect 
discrimination by a monopoly seen in equation (4.1). Although the social planner and the 
monopoly have different objectives, they both charge the more elastic segments (large e) 
near marginal cost while they force the more inelastic ones to bear a high share of the 
fixed costs of the public sector or of the monopoly rent. When the fixed cost rises from 
zero, the A parameter increases above zero and prices start to be distorted away from 
marginal cost. A large fixed cost can be recovered only by giving leeway to the firm to 
raise a large producer surplus i.e., by increasing A. At the limit when the parameter 
is infinite, the ratio -^ tends to one and (17.2) becomes identical to equation (4.1) that 
characterizes profit maximization. The maximum fixed cost that can be recovered is the 
maximum producer surplus which is precisely the monopoly profit as studied in §4.2.2 
(cf. Boccard (2010a)). To conclude: 18 ® 

Welfare maximization of an heterogeneous demand under a single profit constraint 
can range from the perfect competition outcome to the monopoly outcome as the fixed 
cost grows from zero to the maximal sustainable level. 

Notice that optimal prices are larger than marginal cost but not proportional to them, 
rather, the wedge is proportional to the inverse elasticity of demand. This formal anal- 
ysis therefore confirms the historic rule of thumb according to which "prices ought to be 
set according to the value of service (perceived by users)". 

When two-way transfers between regulated firms and the government are feasible, 
public services become an additional avenue for taxation and public funding. If A denotes 
now the cost of public funds, every € that can be taken away from the firm's profit can be 



used to reduce the general taxation burden, thus saving 1 + A € to taxpayers. This means 
that the social value of profits is magnified by a factor 1 + A. The regulator therefore 
maximizes W c {q) + (1 + X){W P {q) - F) so that the efficient pricing rule is again (17.2). We 
can thus interpret the economy wide A that we take as exogenous, as an average of 
the A's endogenously determined in each industry that must stand on its own (budget 
neutrality). Observe that the application of this rule to all regulated industries implies 
that profit making industries end up subsidizing deficit making ones. If the government 
cannot give and take at will, then transfers among industries are impossible to carry out 
yet it remains possible to organize them inside a firm so that a profit making branches 
finances a deficit making one. 

Wealth Effects 

The partial equilibrium approach we follow in most of this book disregards wealth effects 
which is acceptable whenever the amount disbursed by a household or firm to procure 
the good or service is a small fraction of his income. Almost by definition, public services 
are crucial for the poorest families (cf §16.1.1) who may thus face liquidity constraint 
when buying them. For instance, efficiency commands to price at marginal cost and set 
a (per-capita) subscription fee large enough to cover fixed cost. But this may lead some 
poor clients to do without the service altogether (and use an improper substitute). A 
sound pricing policy for regulated firms should therefore trade efficiency against (income) 
redistribution. 

Feldstein (1972) adds a distributional concern in regulated two-part pricing. For nor- 
mal goods, whose demand rises with income, the price should be set above marginal cost. 
This way, richer consumers carry a larger share of the fixed cost. This discriminatory 
outcome, akin to a reduction of income disparity, is a side benefit of a policy focused on 
efficiency under budget neutrality and which ignores distributional concerns. However, 
if the public service is an inferior good, whose demand decreases with income, it should 
be priced below marginal cost, so that the subscription becomes larger (in proportion of 
total expenses). Whereas all public services used at home like energy or telecommuni- 
cations are normal goods, public transportation is probably an inferior good since most 
people prefer to use a car if they can afford it. The pricing policy should therefore be 
markedly different. Most cities correctly offer monthly subscriptions to their subway 
i.e., set the price of a ride at zero which in any case is close to the marginal cost. 

To simplify exposition, the general approach based on utility functions is streamlined 
by assuming that heterogenous consumers characterized by some parameter differ in 
their (constant) marginal utility of income vg. We denote E[] the expectation operator 
associated with the distribution of 8. Individual welfare, a monetary measure, is the 



consumer surplus Wa t g{p) at unit price p minus the fixed cost contribution /. Global 
welfare is the mean welfare, weighted by marginal utility of income i.e., 

W = E[v (W d ,g(p)- f)]=E[vgW di g(p)]-E[vg]{C(D(p))- pD(p)) 

since the fee is adjusted to cover the remaining fixed cost. Given that the marginal 
consumer surplus is the opposite of individual demand (cf eq.(2.18)), the FOC for the 
maximization of W is 

nved e ] , n ^ P-C m E[v g dg] 

- Er - = (C m -p)D -Do e = - 1 (17.3) 

E[v e ] p E[v e ]E[d g ] 

introducing the (negative) elasticity of demand and using the fact that total demand is 
D = E[dg]. The RHS of (17.3) is the covariance between marginal utility of income and 
demand. For a normal good it is negative so that the price mark-up is positive but for 
an inferior good the reverse holds. If income distribution does not matter then vg = 1 and 
(17.3) calls for marginal cost pricing. 

17.2.3 Price vs. Quantities 

Governments often intervene markets because of externalities , whether negative such as 
pollution (e.g., C0 2 , noise) and congestion (cf. §25) or positive such as public services (cf. 
§16.1.1) 19@ and public goods (e.g., justice, police, defense). In all these cases, production 
of the main output generates a by-product, called the externality transmitter, impacting 
society adversely or favorably. For instance, a coal plant generates one ton of CO2 per 
MWh of electricity. 

Analytically, the activity under review is characterized by a cost C{q), a market rev- 
enue 20 *^ R{q) and, by way of the transmitter, an external harm H{q) and/or boon B{q). 
Social value is defined as V = R + B-H so that welfare is W = V - C whereas the firm's 
profit is n = R - C. In the absence of property rights regarding the transmitter, no one 
controls its production or consumption so that the firm produces to maximize profit. The 
optimal production q solves the FOC is R m = C m and is sold at price p such that D(p) = q 
(where D is the market demand for the product). In a world of perfect information, the 
government or regulator knows everything and can compute the welfare maximizing 
level q* (solving V m = C m ) which is greater (resp. lesser) than q in the case of harm (resp. 
boon). 21@ To bring production in line with the efficient level, the regulator can either 
mandate the firm to produce no more (resp. no less) than q* in the case of harm (resp. 
boon). Alternatively, she can mandate a maximum (resp. minimum) price p* = C m {q*) in 
the case of harm (resp. boon). Prices and quantities are then interchangeable perfectly 



efficient policy instruments. 

Weitzman (1974a) shows that such a conclusion must be amended once demand and 
supply uncertainty bear on the regulator. 22 ® Specifically assume that cost and valuation 
are functions C{d,q) and V[f},q). We denote E[.] the expectation operator over the proba- 
bilistic distribution of the independent random variables and f\ (upon which everyone 
agrees). Let W{q) = V (f/, q) - C (0, q) . 

When using the quantity instrument, the regulator can only maximize the expected 
welfare E[W(g)] to arrive at an expected quantity q solving E[V m ] = E[C m ]. It is obviously 
ex-post inefficient since V m {i), q) = C m (0, q) is a zero probability event. The price instru- 
ment works in a more contrived way. After being told to price at p, the firm chooses the 
quantity qe (p) solving p = C m [0,q). Constrained welfare is thus E[v[fj,q- e {p)) -C{d,qg{p))] 
whose maximum p solves the FOC 
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(V m (qe)-C m (qe)) d -§}=0 => e[^(^)^] = pt[%] (17.4) 

The output qeip) will then depend on the state of the world but since it ignores the 
harm information (fj), it also falls short of ex-post efficiency. The criteria to compare the 
instruments is the advantage of prices over quantities A = E[W{qg{p)-W{q)]. 

To permit an analytical derivation, we assume that around the optimal quantity q, 
functions are linear with C m (0,q) = c + Q + 8q and V m [f\,q) = v + fj- fiq where 6 and fj are 
centered variables with variance a 2 and v 2 . We have q - J£=|, qeip) - p ~ c s ~ and -^ -\ 
which is non random. Equation (17.4) simplifies into p - E[V m ] - E v-6 - ^{p-c-0) = 
v-^{p-c)^>p = c + 6q which links the two policies. Observe then that qeip) - q - § so 
that taking expectations, we have E[q g {p)] = q and Vlqeip)] = ??• Since W{x) = {v-c)x + 
{f}-0)x- ^j-x 2 , 

A = {v-c){E[qe{p)]-q) + E[(fj-6){q 9 (p)- q)] - ^E[q e (p) 2 - q 2 } 
±E [(ij-0)0] - ^-Vlqeip)] 
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Price is the warranted control instrument when the curvature of the cost function (6) 
is greater than that of the value function (/3) i.e., uncertainty regarding cost generates 
more variability than uncertainty regarding valuation. Contrarily, quantity regulation 
is socially superior when the valuation can vary widely with uncertainty as with natural 
disasters and health outbreaks. 

If instead of interacting with the producer to take advantage of his cost minimizing 
behavior, the regulator interacts with the consumer (to take advantage of his value maxi- 



mizing behavior), all the computations go along only that labels must be exchanged, thus 

2 

the advantage of price (told to the consumer) over quantity is ^zifi- 8) so that quantity 
is either dominated by price told to firm or told to consumer (but dominates the other 
one). 

If instead of a monopoly, an entire industry is regulated the case for prices is rein- 
forced because quantity schemes involves inefficiencies across members of the industry. 
Specifically, when the technologies are similar in curvature (same 8) but still suffer dif- 
ferent shocks, the (5 factor is multiplied by + n ~ p < 1 where p £ [0; 1] is the correlation co- 
efficient between firms' shocks (i.e., covariance is atj - pv 2 ). A larger number of industry 
participants and more independent cost between them (lower p) make prices relatively 
more attractive to the regulator. 

17.2.4 Public (non-market) Provision 

As explained in §13.1.3 on bureaucracy, the provision of many public services is provided 
in-house by a bureau or agency. Niskanen (1968) proposes a formal model of bureau 
behavior based on two premices. Firstly bureaucrats, motivated by the ideal of public 
service, prefer more to less budget 23 @ in order to provide the public with more output. 
Next, a bureau negotiates with the finance ministry a bundle of service objectives against 
a global budget (able to cover for the cost of the aforementioned objective). 

Within such a framework, the bureau is like a discriminating monopolist making a 
"take-it-or-leave-it" offer to society; he will thus be able to exact an amount equal to 
society' total willingness to pay (TWTP) for the stated objective. Since the latter sums 
the social WTP for all units, the bureau's ideal offer is the output at which the social WTP 
is nil, an amount considerably larger than what efficiency warrants since it completely 
ignores cost. Yet, the prohibition of running deficit may force the bureau to back down a 
little. 

Indeed, in our linear framework P{p) = ^- and C m {q) = c + 8q, the efficient output 
solving P = C m is q* = f^ff . The budget limit is TWTP V(q) = tfP(x)dx = aq-\q 2 while 
total cost is C{q) = cq+ \8q 2 , hence the financing condition V > C boils down to q < 2q* 
which means that the bureau will tend to operate at twice the adequate level. 24@ 

Assuming that all the budget is spend to increase output (or quality) of the public 
service is quite strong. Obviously, the bureau has enough leeway to spend money into 
items and issues that bring satisfaction only to the bureau members, thus being socially 
unwarranted. We should thus distinguish within the budget between appropriation and 
production. It remains nevertheless true that production will not be lesser than q* since 
a marginal increase brings about utility, thus budget at a rate equal to WTP which is 



greater than marginal cost, hence also increase the free cash flow. As output rise from 
zero to the maximum 2q* that can be financed, cash flow (profit) rises to a maximum at 
q* and then decreases back to zero at 2q*. 

Hence, a pure profiteer at the head of the bureau will produce q* in order to enjoy 
the maximum cash flow; in this case, a careful audit of cost would reveal some "fat" in 
the budget (wasted resources). Yet, allocative efficiency is achieved since the socially 
adequate level of output is chosen. There is however a distributive inefficiency inso 
far as the welfare (difference between TWTP and cost) appropriated by the bureaucrats 
requires distortionary taxation to be incorporated in the budget and is then spend in 
unproductively 25@ 

At the opposite, a pure civil servant will choose 2q* and leave no "fat" since his bud- 
get just covers his total costs. 26 ® Most likely, the output decision will be intermediate 
between those extremes reflecting the mixed motives of the bureau managers. In any 
case, suppliers to the bureau (selling input at total price C{q)) share a common interest 
with the bureaucrats in expanding the budget. 

It is obvious that this formalization of the bureau applies to any organization such 
a division within a firm or a school or an hospital which does not sell its output at unit 
price in a market but negotiate with a higher authority an annual objective against a 
budget. Take-over mechanisms (or competitive entry) being more readily available in 
the private sector, there should be less discretion in the use of budgets. 

Williamson (1963) shows that managerial discretion has a non trivial impact on re- 
source allocation within the firm when a manager cares for staff size and free cash flow 
(FCF) spend over pet projects. Given revenue R, cost C, staff S, tax rate t and the mini- 
mum profit 7r required by shareholders, free cash flow is it = (1 - t)(R- C-S) - n . If the 
manager utility is u[F,n) then the FOCs for maximization are R m = C m and || = 1 - jr^. 
The former corresponds to standard behavior while the latter shows excessive staffing 
since the marginal value of staff in terms of added revenue is equated with less than its 
unitary cost, due to the MRS between CFC and staff expressed by the manager. If the 
tax rate increases then output and staff are likely to increase. 

17.3 Practical Regulation 

The previous section was theoretical and emphasized the ideal price structure assuming 
implicitly that average cost pricing was easy to implement and would provide the firm 
with adequate revenues. In this section, we delve into the more concrete problem of 
revenue structure raising up a number of practical difficulties. 



17.3.1 Cost Based methods 

The regulatory framework emphasizing cost aspects is diversely known as "Rate of Re- 
turn Regulation" ,"Cost of Service" or "Cost Plus" (C+). The label "Cost Plus" is used 
for public services such as the local police or the building of an hospital (public pro- 
curement) while the label "Cost of Service" is used for private services like distribution 
utilities (water, gas, electricity) or the trash collection. 

Fully Distributed Cost 

The first step in the C+ regulation is to allocate all cost among activities of the firm 
whether some or all are regulated. Take the example of local (#1) and long distance (#2) 
telephone. Total cost can be decomposed as C = F + c\q\ + F\ + 02^2 + ^2 where a and Ft are 
the marginal and fixed cost of activity #i while Fo is the (fixed) cost of production shared 
by the two activities. This encompasses the administrative cost but above all the cost of 
building, maintaining and operating facilities used by the two activities. 

The most disputable decision to be taken by the regulator is setting the share Aj of the 
join cost Fq that falls on activity #i. The revenue requirement of this business line, given 
the estimated demand qt is then Q = aqt + Ft + A ; Fo and the final stage of the regulatory 
process is simply to tune the price of service #i so as to ensure that revenue meets cost 
i.e., to price at average cost. For a non regulated activity #/, the firm sets her price 
to maximizes profits taking into account that the fixed cost A,jF will not be covered by 
regulated revenues. 

Among the distribution methods, the attributable cost sharing sets A; = C[ q% ' 27@ and 
the output sharing sets A ( = ^ . However, the political will to support an activity or a 
group leads to distort the objective schemes presented above. For instance, urban areas 
frequently subsidy rural ones in networks. One can then speak of politically or ethically 
motivated cross-subsidization. 

Fair Rate of Return 

In the previous discussion, fixed cost are economic cost, not accounting ones, thus they 
incorporate a remuneration of the capital invested. In the C+ framework, the regula- 
tor delineates the regulatory asset base (RAB) consisting of all the assets necessary to 
perform the regulated activity and sets a fair rate of return p to be earn on the RAB. 
The notion of "fairness" has been hotly debated in the US but stabilized after a 1944 
supreme court judgment stating that "the return to the equity owner should be commen- 
surate with returns on investments in other enterprises having corresponding risks"; 
this is what is usually called the cost of capital. 



Practically speaking, the regulatory process starts with lengthy review of all invest- 
ments in order to update the RAB composition and its valuation K. The maximum al- 
lowed profit for the regulated firm is then set to be pK. If the revenues net of operating 
cost fall short of this benchmark, the regulator must increase the prices otherwise prices 
are cut to reduce this excessive rent. We now proceed to analyze the behavioral conse- 
quences of the C+ regulation. 

Gold Plating or Padding 

Since the inception of C+ regulation, it has been argued that firms try to inflate their 
RAB to earn higher profits since the multiplier, the fair rate of return, is quite constant. 
This strategy can take various forms such as buying unnecessary material or designs (as 
if gold plating of copper wires was necessary for telephone service), increase capacity of 
service excessively (on grounds of security, reliability or quality) or enter non profitable 
markets (often unregulated). The later, more modern strategy, also enables the firm to 
preempt markets that may become profitable in the future. 

In practice, the harm is limited because the firm's costs statement are scrutinized 
by regulatory commissions with the threat that imprudent investments or unnecessary 
expenses be left out of the RAB. The bureaucratic review process which is itself costly 
for the economy has to a large degree been successful in preventing grossly excessive 
spending by regulated companies. Another positive feature, though not intended by 
regulators, is that during the period separating two rate reviews, prices remain constant, 
thus incentives to cut cost are greatest (cf "fixed price" regulation later on). 

Perverse incentives of C+ regulation are nevertheless present. Wellisz (1963) shows 
that a utility confronted to alternating peak and off peak demand (cf. §25.3) has a clear 
incentive to lower peak prices or at least to oppose the more efficient but higher peak 
load prices. If successful, this strategy boosts the peak demand beyond the efficient level 
with the consequence that to maintain service the firm has to increase capacity. In the 
process, the firm increases her RAB and thus her profits. 

Averch-Johnson Effect 

In the first mathematical model of firm behavior in a regulatory framework, Averch and 
Johnson (1962) show that the rate of return regulation reduces the perceived price of 
capital so that the firm is lead to use a relatively excessive amount of capital in compar- 
ison to other inputs such a labor; it might even be the case that the workforce is reduced 
through "downsizing". We study two flavors of this model, firstly under a strict C+ reg- 
ulation where the price is set by the regulator and then when the firm is free to set the 



price. 

Consider a regulated monopoly whose production function is 0(X,L). The market 

willingness to pay is denoted P{q) and R{q) = qP{q) is the revenue function; profit is 

U{K,L) = R{Q>{K,L)) - wL-rK where r is the cost of capital and w the wage rate. The cap 

imposed by the regulator on the rate of return (ror) on capital (asset base) is denoted p; 

it leads to the constraint 

RmK,L))-(oL n 
P = = K + r ~ p ( 7 ' 5) 

The ror at the unconstrained monopoly optimum is denoted p*. A meaningful fair 
rate p should satisfy 

r<p<p* (17.6) 

for if the limit is set below the cost of capital r, no investor will remain in that activity 
and this will put the service at risk. 28@ Likewise if the limit is set above the unrestrained 
rate of return, the firm will happily continues to behave as a monopoly under a useless 
regulation. 29 ® 

Under a strict C+ regulation, the regulator sets the price p which in turn determines a 
demand q = D{p) but also the total revenue of the firm; profit maximization then amounts 
to cost minimization. The only task of the firm is to choose inputs K and L so as to meet 
demand i.e., <${K,L) = q and its optimal choice is thus the cost minimizing pair {K* ,£,*). If 
the resulting ror p* is greater than p, then the firm must find a way to meet constraint 
(17.5) with equality without losing too much profits. Equality in (17.5) yields p = ^^. 
We now use the fact that 0(iC, L) = q (inputs meet demand) to deduce that L is a decreasing 
implicit function of K. It is now obvious that to reduce p one cannot do otherwise than 
increasing K until p = p. 30 ® 

I The capital to labor ratio chosen under rate of return regulation is excessive with 
respect to the efficient one (which minimizes production cost). 

In the original setting of Averch and Johnson (1962), the firm is less tightly regulated 
since it is free to set the price of the regulated service (or equivalently the level of pro- 
duction). The previous result continues to hold: for a given production and associated 
revenue, the firm over-invests in capital (with respect to the cost minimizing mix). What 
is then the optimal output for a ror regulated monopoly? Starting from the uncondi- 
tional profit maximizer which violates the ror constraint, a capital increment loosens 
the constraint as can be seen from (17.5) and increases production, thus decreases prof- 
its which further loosens the constraint; we have thus shown that the optimal production 
under ror regulation is larger than without regulation i.e., q> q* . We can then state the 
original conclusion of Averch and Johnson (1962) 



A profit-maximizing firm regulated by a cap on its ror increases production without 
minimizing cost and by overinvesting in capital with respect to other inputs such as 
labor. 

The previous reasoning can be illustrated with the help of Figure 17.2. The expansion 
path of cost minimizing inputs solves r/O^ = <y /<&£(= C m ); it is an increasing curve in 
the {K,L) plane. The ror constraint p defines an "eye-drop" shaped curve such that 
inside points are prohibited. 31@ Since p* > p, the unconstrained optimum is a point on 
the expansion path strictly inside the "eye-drop" (it uses capital K*). The constrained 
optimum is a point on the "eye-drop" that maximizes profits. Since (17.5) holds with 
equality, profit is simply (p - r)K, hence the maximum while staying on the "eye-drop" is 
reached at the right most point K. 
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Figure 17.2: Rate of Return Regulation 

Lastly, we assess whether it is welfare enhancing to set the cap close to the cost of 
capital (recall that it cannot be lower if one wants to maintain service). Strengthening 
the regulation by lowering the cap, amounts to inflate the "eye-drop". The new optimum 
is displayed as a diamond on Figure 17.2: on the one hand production is increased which 
is positive but on the other hand, the input mix is pushed further away from the cost 
minimizing expansion path which is negative. To disentangle this dilemma recall that 
the second best solves P = AC and that for a natural monopoly we have AC > C m every- 
where (scale economies). Setting p nearby r forces the firm to produce an output that 
nearly equates P and AC, the distorted average cost. As it very unlikely that the distor- 
tion of input mix invalidates the inequality AC > C m , we would still have P>C m which is 
the sign that the additional production so obtained was welfare enhancing. 

Strict equality of p with the cost of capital is not advisable. As we show in §19.3, 
whenever there is some slight uncertainty regarding future demand or cost, it is neces- 



sary to leave a minimal rent to the firm in order to guarantee her presence in the market. 
We can therefore conclude 

I The socially optimal ror regulation of a natural monopoly should be set nearby the 
cost of capital. 

Differing Objective 

As shown by Bailey and Malone (1970), the ror regulation yields an opposite result 
if the objective of the firm is either revenue or output maximization which is a quite 
plausible objective for public utilities having local elected representatives on their board 
of directors (this issue is discussed in §15.1). 

To catch the intuition, recall first that the ror constraint is R{q) < wL + pK. To maxi- 
mize revenue R{q), the manager can start from very large amounts of inputs whose cost 
is large; this enables a large production but a small revenue since the price has to be 
adjusted downward to sell such a quantity. There is thus considerable slack in the ror 
constraint. Consider now reducing capital K, the revenue increases at speed Jjj? = <£>KRm 
while the cost element in the constraint reduces at speed p; the slack in the constraint is 
reduced at rate 0^fl m -p. A reduction of labor L reduces the slack at rate <E>i.R m - w. When 
the constraint is violated at q, the reductions in capital and labor are tuned so that all the 
available slack is used up; at the optimum, it must be true that 0^i? m - p = = ®z,i? m - w. 
A simple algebraic manipulation yields ^ = £ > ~ as if capital was more expensive for 
that firm; it is then obvious that the optimal mix involves under-capitalization (with 
respect to the cost minimizing way to produce the optimal quantity). 

A revenue-maximizing firm regulated by ror restricts output (with respect to her ideal 
choice) but underinvests in capital and oversizes its workforce (with respect to the cost 
minimizing choices). 

17.3.2 Incentive Regulation 

Beyond the issues previously debated, the C+ regulation's main defect is that the firm 
has no incentive whatsoever to reduce its cost structure, improve quality or to take 
risk at introducing new technologies. The idea of incentive regulation is to thwart the 
"business-as-usal" behavior of managers of firm acting under a C+ regulation. The basic 
and theoretically most powerful scheme is the Fixed Price (FP) aka Price Cap regula- 
tion. For a public service such as the building of a public library, the regulator sets a 
price equal to the average expected cost of service. For a private service such as trash 
collection, he computes the fair price to be charged to end users. 



Under FP, the firm is the residual claimant of any reduction of its production costs; 
this gives her the maximum incentive but also forces her to bear maximum risk. A 
frequently studied example is the US Medicare program that repays the hospital a fixed 
amount for each patient according to his diagnostic group. 

Sliding Scale 

More than a century ago, English gas utilities were regulated by a sliding scale; starting 
from an adequate initial price and profit, the firm is allowed to increase profits if she 
lowers prices so that consumers also benefit the change. For instance, if a new production 
technique allows a 2€unit cost reduction, then the price has to drop by l€so that the 
unit margin increases also by 1€. The sliding scale is in fact a combination of C+ and FP 
whereby the firm is allowed a price increase if her cost rises but at a less than one-to-one 
rate. C+ occurs when the rate is unity (price tracks exactly cost) while under FP it is 
zero (price ignores cost). A sliding scale thus provides incentives to cost reduction and 
risk-sharing at the same time. 

Revenue Cap 

some energy utilities used to be regulated trough a global revenue cap in order to allow 
some flexibility in the setting of prices across market segments. A defect of this regu- 
lation identified by Crew and Kleindorfer (1996) is the tendency for firms to overprice 
their services. In the traditional price-quantity space, a revenue cap corresponds to an 
iso-revenue hyperbola below which the firm can choose her price-quantity pair as shown 
on Figure 17.3. The highest conceivable is the one passing through the unregulated 
monopoly optimum {q M ,p M ). The regulation will make sense only if the cap is stricter 
than q M x p M . In that case the new limiting revenue hyperbola will cut the demand at 
two prices above and below p M . 



iso-revenue curves 




Figure 17.3: Revenue Cap Regulation 



Since the pair has to lie on the demand curve, there are in fact only two possibles 
prices that lie around the unconstrained monopoly price. The firm will elicit the larger 
price p A because it yield less sales, thus less cost but the same revenue which ultimately 
means more profits than the lower price p B . 

Price Cap 

The now most frequently used form of incentive regulation is the price cap, proposed orig- 
inally by Littlechild (1983) for the regulation of UK's former telecommunication mono- 
poly at the time of its privatization. This framework amounts to fix the maximum output 
price that can be charged to end-users, thereby generating a minimal consumer surplus. 
Since the firm's profit is an increasing function of price from zero to the monopoly's op- 
timum, the firm will price at the cap whenever it is set below the monopoly price by the 
regulator. 32 ® 

Obviously, the lower the cap, the greater the production and the consumer surplus 
but the lower the firm's surplus. This might become problematic because if the firm 
cannot cover her fixed cost out of this left-over, shareholder are getting a below average 
return on equity and may therefore force the firm to exit the market. To avoid this 
dreadful eventuality, the regulator is forced to set the cap strictly above the expected 
average cost of the firm so that the efficiency gains from setting a maximum price lower 
than the unconstrained monopoly choice are somehow wasted away. In that respect, FP 
fares worse than C+ which is an ex-post regulation able to compensate the firm for any 
unexpected event that lowered profits. 

The positive side of FP regulation is its ability to maintain the firm on the track of 
technological progress. Indeed, the price cap leaves the firm free to pocket the full benefit 
of any cost reduction she might succeed to generate; it therefore fosters the adoption of 
innovative technologies whereas, in this respect, the C+ framework leaves the firm indif- 
ferent. Obviously, the downgrading of quality is an easy way to achieve cost reductions 
which is why regulators explicitly include quality objectives in the service contracts of 
utilities. We may therefore conclude 

FP regulation fares worse than C+ with respect to allocative efficiency but better with 
respect to dynamic efficiency understood as the ability to incorporate technical inno- 
vations. 

Since the economy is constantly evolving, the price cap has to be adjusted periodically. 
Firstly, one accounts for the general increase of input prices measured imperfectly by the 
consumer price index (CPI) (retail price index (RPI) in the UK). Secondly, the increase 
of demand and the technological progress in the economy or within the sector have to be 



integrated in the formula; this leads to the famous "CPI - x" or "RPI - x" formula stating 
that the price in the next review will be increased by CPI% minus x%, where x measures 
the savings due to the exogenous rate of technical progress and the increase of demand 
that enables to take advantage of scale economies. 

Yardstick Competition 

The price observed in a competitive market is the ideal yardstick to measure the level 
of unit cost in the industry since individual output realizes the equality of price and 
marginal cost. This public information is quite useful for a firm owner as it enables him 
to check whether her manager runs properly the business. Whenever a firm is shielded 
from competition as in the case of a regulated natural monopoly, one loses this reference. 
Mead (1944) proposes to compare the performance of the regulated firm to that of all the 
other firms producing the same good or service in similar conditions; this way one builds 
a yardstick that can be used to discipline the manager. Nowadays, regulators performs 
statistical productivity analysis upon samples of comparable firms to determine indexes 
of efficiency that later serves to set the x factor in CPI - x price cap regulations; for 
instance one can set it to half the distance between the worse and the best firm in the 
sample. Shleifer (1985) calls this form of regulation yardstick competition and shows it 
can induce all firms to behave efficiently in terms of cost reducing innovations. 

Comparison of FP and C+ 

Table 17.1 summarizes the main features of the Rate of Return and Price Cap regulatory 
frameworks. 



Dimensions 


Price Cap 


Rate of Return 


1 - Prices flexibility 


Yes 


No 


2 - Regulatory lag 


Long 


Short 


3 - Sensitivity to realized costs 


Low 


High 


4 - Regulatory discretion 


Yes 


No 



Table 17.1: Regulatory Regimes 



Some comments are: 

A Price flexibility: if there were no asymmetry of information there would be no point 
in letting the firm set (potentially harmful) prices. Otherwise flexibility may allow 
the firm to employ its superior information to design prices that generate higher 
levels of welfare. The usual forms are average revenue regulation and tariff basket 
regulation; in the later, the firm can chooses the prices it sees fit as long as total 



revenue does not increase with respect to the previous period. The problem is that 
the firm may cross subsidy a segment to attack a competitor and may also cease to 
cross subsidy the poor. A partial remedy for the latter is to include a public service 
obligation. 

B A regulatory policy may be unable to secure substantial surplus for consumers 
when it is first implemented, but repeated application of the policy may serve con- 
sumers well. Suppose that the X factor in a CPI-X regulation is updated period- 
ically to eliminate the firm's expected future rents and is based upon the current 
realized revenues and costs. Even though the firm can retain all yearly profits, 
it recognizes that larger present profits-generated for instance by efficiency gains 
may result in smaller future earnings. Consequently, the incentive to reduce oper- 
ating costs is weakened (cf ratchet effect §17.3.3). Frequent revisions are optimal 
if costs are exogenously driven and demand is elastic because the deadweight loss 
matters. Otherwise reviews should be infrequent to motivate the firm to reduce 
costs. Fixing the X factor is the most difficult and controversial part of the regula- 
tory process (firms often sue their regulator delaying and weakening the regulation 
process). 

C Although the X factor is supposed to track possible cost reduction due to techno- 
logical progress, the regulator can renounce to use cost information because it is 
difficult to decipher from the firm accounting statements. Another reason comes 
from the fact that lump-sum transfers to the regulated firm are prohibited by the 
EC so that the price cap is the only instrument available to achieve two objectives: 
provide incentives to reduce costs and make sure that prices follow realized cost in 
order to promote allocative efficiency (deadweight loss reduction). A compromise is 
thus required. 

D Lastly we consider how much policy discretion to afford the regulator himself. If 
granted flexibility there is the risk that he acts opportunistically (ratchet effect); 
he won't be able to refrain from maximizing welfare ex-post (setting a large X) 
and this will badly distort the ex-ante incentives of the firm toward cost reducing 
investments. Alternatively, the regulator might succumb to industry pressure. 

Given the practical difficulties for the regulator to characterize the missing informa- 
tion, the relevant constraints, the correct objective and the most reliable instruments, 
he is often lead to use simple regulatory rules. The ror regulation is interventionist in 
that all prices are set by the regulator and constantly revised using cost statements to 
ensure closeness to the ror objective which at the same time is guaranteed to the regu- 



lated firm. Under a price cap regulation the firm is free to discriminate among product 
lines or clients and for a longer time before revision. This is so because the regulation 
ceases to monitor cost. Nonetheless the regulatory regime being more flexible it is also 
susceptible of important changes when revision comes. 

17.3.3 Regulatory Constraints 

The previous analysis completely ignored the constraints on information, contracts and 
policy that the regulator faces whenever trying to implement a regulation. These issues 
are related to the agency theory developed in part H and to firm theory that deals with 
authority relationships inside firms (cf §13). 

Transaction Costs 

The first constraint faced by a regulator is related to transaction costs (cf. §13.3.3), 
the difficulty to write and enforce contracts, between himself and the firm. Incumbents 
of regulated services or challengers willing to replace them often make promises that 
look attractive for the public but which are difficult to monitor and enforce. Regarding 
enforcement the regulator has limited powers. If the firm fails to deliver on promises, the 
regulator can hardly terminate the contract on grounds of "failure to meet contractual 
performances" because this is an implicit confession of the regulator's own failure to 
select an able candidate. Furthermore, the lag of time necessary to conduct another 
selection process is likely to result in deteriorated service which is very costly in political 
terms. 

In the same line, the effectiveness of regulation is limited by the law establishing the 
regulatory agency, by the laws of higher order that limit the available instruments, by 
government meddling (for electoral purposes) and by the integrity of the regulator itself. 
Regarding the last issue, lobby groups representing consumers, but above all, individ- 
ual firms and their professional associations often step in the regulatory process and try 
to influence the regulator; when successful one speaks of "regulatory capture", a topic 
we already studied in §16.3. A remedy is to isolate the regulator from all interferences 
through the original legislative act but it comes at the cost of inflexibility. A rigidity 
often ascribed to regulators is their obsession with stable prices and their dislike of dis- 
criminatory pricing and Ramsey-Boiteux formulas. It can be understood as a guarantee 
for both producers and consumers that prices will evolve smoothly. Such an insurance 
fosters investments by both sides into the network and into devices to use the network 
service, respectively. 



Hidden Information 

The ideal (second best) regulation for a public service is a per unit price (close to marginal 
cost) coupled with a subscription (covering fixed cost). The correct values are not easily 
computed because the regulator has to use the cost statements handed over by the firm 
who might therefore have an incentive to overstate her true cost. This possibility is 
a real issue given her better knowledge of production technology and demand. In the 
absence of a costly monitoring, the manager of a private firm is likely to behave as a 
pure monopolist simply claiming he faces tough conditions to justify the high prices that 
maximize his profits. Placed in a similar situation, the manager of a public firm would 
pursue personal objectives at odd with public interest. 

The distortion brought about by this asymmetry of information is studied in §21.2.3. 
There is a traditional distinction between variables that are endogenous like the inten- 
sity of effort or the favoring of one activity over others and those variables that are 
exogenous like the knowledge of underlying cost or future demand. In the first case, one 
speaks of hidden action or moral hazard (cf §20) while the second case is referred to 
as hidden knowledge or adverse selection (cf. §21). In a one-shot (not repeated) rela- 
tionship, it is possible for the regulator to design an incentive contract that extracts the 
precious information initially possessed by the firm regarding costs or demand. Yet, as 
public funds (cf. §17.1.2) involve a loss of efficiency, the optimal regulatory policy will be 
distorted as if there was an additional marginal cost of eliciting the private information 
of the firm. 

Ratchet Effect and Commitment 

Yet, if the firm reveals her information too quickly, the regulator will start the next 

regulatory round in a much better position and will squeeze her future informational 

rent by asking for higher effort or lower costs. This phenomenon, due to the repeated 

nature of the interaction, is called the ratchet effect. The term was coined by Berliner 

(1957) who observed that, for political reasons, production objectives in the soviet plan 

were always to be set at higher and higher levels ignoring the existence of economic 

cycles or exogenous constraints. 

It was as if the russian economy was ^ w 

a toothed wheel engaged with a lever ^^^^V c^^^^j ^f"^"^^> 

forcing it to turn in only one direction, 

growth as displayed here. 

This mechanical description is that of a ratchet which explains the analogy. Hence, 
in a repeated contracting environment, the regulated firm will not react to incentive 





schemes as we might hope because she needs to conceal her private information. This 
problem occurs even if the regulator adopts a low powered incentive scheme because he 
cannot commit not to change it the day he will discover the true characteristic of the firm. 
Lengthening the duration of contracts would seem attractive but is often an impossible 
commitment given political turnover. One solution is to pass a law of constitutional 
nature (hard to repel) creating a regulatory commission free from political influence and 
with clear objectives such as the European Central Bank or the US federal commissions. 
The major drawback is then the rigidity of the regulator who, by the very nature of its 
charter, cannot choose flexible policies adapted to the current economic environment. 

The root of the problem with the ratchet effect is the inability to commit for future be- 
havior. A similar issue is at work with governments (democratic or not) who can always 
force a renegotiation after elections on the ground that they have a popular mandate for 
it. Especially, when the government control the judiciary, it does act opportunistically 
against private contractors and concessionaires, especially when their investments are 
immobile such as with infrastructures (e.g., network, airport). 

As an application, consider the issue of reducing carbon emissions. An industry may 
fear that its early curbing efforts may be built (ratcheted) into future, differentiated, re- 
quirements, that will be disadvantageous. Consistently with the incentives of the ratchet 
effect, the industry may minimize those early efforts, to the cost of the national program. 
This effect may be a further argument for economic measures, such as transferable per- 
mits. Permits involve a greater degree of transparency and hence commitment by the 
national government and the international community. They do not affect individual 
industries in a differential way after the revelation of information. 

Stranded Cost 

Regulated firms in energy sectors continuously invest in long lived assets. At the onset 
of a deregulation, large cost are already incurred (sunk) but not yet fully depreciated. 
The recovery of these embedded costs becomes uncertain for regulated incumbents with 
the advent of entry that will likely skim their most profitable clients. Entry is either 
due to deregulation (removal of statutory barriers) or to technological developments that 
lower entry costs (in which case the natural monopoly rationale vanishes). Sidak and 
Baumol (1995) explain the dilemma faced by the regulator. The benefit of withholding 
the implicit contract (aka regulatory compact) with the incumbent is stability and the 
avoidance of "regulatory uncertainty" that investors fear so much; this attitude is pos- 
itive for future investments. The drawback of allowing the recovery of stranded cost is 
to derail deregulation. Indeed, if a surcharge is applied to end-users, prices increase in 
the short term while if the surcharge is applied to inputs (e.g., network access) needed 



by entrants, it forestalls entry. 

A simple theoretical answer to the dilemma would be to judge according to the "pru- 
dent investment" criteria. If the coming of competition was foreseeable, then no recovery 
should be allowed whereas if increased competition was unforeseen, partial recovery is 
optimal to avoid giving firms incentives to make exceedingly risky choices (cf. §23.2.1 on 
free cash flow). However, the situation observed often involved an incoherence, traduc- 
ing a successful regulatory capture. Several years before the actual deregulation would 
materialize by entry of new players, a government would prepare the future law with 
the hope of bringing down the price for end-users either through a reduction of average 
cost or a lessened exercise of market power. At the same time, the government would 
negotiate stranded cost with incumbents. The problem is that if entry or foreseeable 
technological improvements make it almost certain that the price will go down, then all 
the authorized investments under the old regulatory compact reveal themselves to be 
imprudent or undue; indeed, the regulator had the knowledge that either incumbents 
were exercising market power or that new technologies were readily available. 

17.3.4 Public Service Obligation 

A legal introduction to this topic is Sauter (2008) while an economic one is Cremer et al. 
(2001). 

Concept 

Public services, or services of general economic interest (SGEI) in European parlance, 
are a set of products and services deemed essential by the community in order to pro- 
mote social and political objectives, such as equity, participation, cohesion or solidarity. 
The practical goal is to ensure the availability of a minimum set of essential services to 
all users at an affordable price and a minimum quality. For these activities, there is 
the presumption (often based on past experience) that private provision would be inad- 
equate. More precisely, it could be the case that some groups of individual would not be 
served at all or would be offered the service at an exorbitant price either because the cost 
is high or because the provider has market power. 

A Public Service Obligation (PSO) or Universal Service Obligation (USO) in US par- 
lance 3 ^ is then a duty imposed by the State on an industry to supply the SGEI to all 
customers, irrespective of the fact that some, by virtue of their wealth, usage character- 
istics or location, might be more costly to serve then others. For instance, the telephone 
subscription is regulated at country level although the connection to the network is more 
costly for an isolated house in a village than for an apartment in a urban area. Likewise, 



a local call is costlier to carry than a long distance one but is often billed at a lower rate. 
The PSO might then be defined as the difference between the cost of serving those pro- 
tected customers and the revenue they generate (more precisely the forgone revenues 
from discontinuing the service). It may also involve making financial provisions so that 
the service can be made available to users with a disability or in financial distress. 

The original EC treaty defines SGEI vaguely and implicitly leaves the principles and 
conditions of supply of these services to the appreciation of member states. Since a PSO 
involves a compensation, it potentially infringes the State Aid legislation on distortion of 
competition (cf §9.2.4 for some data). 34@ The ECJ has stated in the Altmark judgment 
of 2003 that the PSO must be clearly defined in the national law with objective and 
transparent parameters for compensation. The latter covers cost and a reasonable profit 
and should not confer a competitive advantage. Whenever the service provider has been 
selected through open bidding or a public procurement process, it is assumed by the EC 
that these conditions are satisfied. Member states, however have leeway to define exactly 
what services are "essential" for the collectivity and what is an "affordable" price level. 
Since 1985, directives implementing these concepts have been adopted in the sectors 
of post, mobile telecommunications, airports, ports and maritime transport, insurance, 
broadcasting, energy and railways. 

Milne (1997) refines the goals of USO according to the state of development of the 
underlying market: 

• network establishment: Acquire new technology, set up long-distance links between 
cities, public telephones. 

• wide geographic reach: Reach regional parity, connect all urban areas with 
widespread adoption in business. 

• mass market take-up: Stimulate economy with residential take-up. 

• network completion: Achieve political cohesion: affordable to all and available to 
disabled. 

• services to individuals: Individual right to communicate and public access for edu- 
cation and health institutions. 

Table 17.2: Stages of Universal Access/Service 



Cross-Subsidization 

Since a PSO usually runs a deficit, its provider is authorized to cross-subsidize the costly 
segment by applying market power upon the more profitable segments. When non dis- 
criminatory pricing is further present, this amounts to raise either the unit price or 



the subscription or to raise the price of another regulated service on top of its average 
(economic) cost. 

In a deregulated market where the PSO provider competes with other firms, this 
strategy ceases to be viable because of the cream skimming threat i.e., an entrant can 
capture the profitable segment where the incumbent is currently pricing above average 
cost by undercutting him. The law can therefore establish a fund where all firms con- 
tribute according to their market shares in order to finance the extra cost of the PSO 
provider. Still, the presence of asymmetric information between the firm and the reg- 
ulator make cross-subsidization unavoidable because the PSO provider will artificially 
transfer cost from the unregulated side to the regulated side in order to gain an edge 
over competitors in the deregulated area. 

The modern implementation of a PSO therefore aims at making the subsidies "ex- 
plicit" rather than "implicit" to ensure competitive neutrality i.e., they should be quan- 
tifiable and their distribution clear enough to avoid favoring one competitor or one type 
of technology. Recent implementations have demonstrated that PSOs are not costly, typ- 
ically a few percent of the sales volume of the company in charge; adding on top the 
goodwill amassed from the general public by the provider (he is seen as a benefactor), 
the net cost of a PSO for the economy appears to be much smaller than what debates in 
the 1990s would have lead one to think. For instance, Ofcom computes the PSO cost of 
British Telecom (BT) as one pound per year per capita and nearly zero if one accounts 
for the added goodwill that the PSO bestow upon BT. 

Origin of the PSO 

The first PSO came from the US and is said to originate from regulatory capture. Uni- 
versal service was originally a commercial strategy of extensive geographical coverage 
focussing on major cities imagined by AT&T in the 1880s for its new telephone service 
(the Bell system) in its competition against the older telegraph service of Western Union. 
With the advent of competition on telephony after the expiration of Bell's patents, the 
AT&T motto "a telephone in every home connected to every other telephone in the coun- 
try" was clearly aimed at taking advantage of positive network effects and eliminating 
competitors by refusing interoperability with their system. To avoid costly duplication of 
networks, telephone was granted protection from antitrust laws in 1921 so that mergers 
and technical convergence could take place. Later on, with the federal communication 
commission (FCC) created in 1934 and the Telecommunications Act of 1936, the Bell 
monopoly became regulated but on costs only. Except for the preamble of the law, there 
was no mention of affordable rates, a notion linked to willingness to pay of poor house- 
holds. 



Mueller (1997) claims that universal service and the underlying cross-subsidization 
was made popular by AT&T in its defense against antitrust allegations by claiming that 
universal household telephone penetration never would have existed without monopoly 
and regulatory subsidies. A close examination reveals that rate subsidization started 
in 1965 when more than 70% of residential households had been connected without any 
help from the government. The practice evolved out of the debate over how to properly 
separate costs from long distance that are under federal regulation from those of local 
services which are under state regulation. This author thus holds a "regulatory capture" 
view of universal service and conclude that one could dispense with it. This run opposite 
to the general view in continental Europe where universal service is seen as an atrophied 
version of the traditional public service. 

Conclusion 

At the outset of this section on practical regulation, the reader may get the feeling that 
regulation does not work well. This is entirely correct but it does not constitute a reason 
for its removal for we miss the counterfactual: what prices and business behavior would 
we observe if a public service (without close substitute) was completely deregulated ? 
The answer is still unknown since all governments at local, regional and country levels 
have always regulated those activities even in the UK or US, the staunchest supporters 
of unbridled competition. 

The one exception we may cite is telephone. The decrease in the cost of setting up 
a mobile phone network over the last decades has allowed entry of many players and 
generated an intense competition which in turn has spilled over the landline monopoly. 
In the UK, the regulator Ofcom, after forcing a vertical separation between landlines 
maintenance and retail upon the former monopoly, has succeeded to create a competitive 
market. The only remaining restraint is a PSO whose cost is in line with the benefit it 
bestows upon its holder. 



Chapter 18 



Natural Resources 



Natural Resources, whether renewable like forests and fisheries or exhaustible like oil 
and minerals are commodities of great importance to our material well being. Even 
though advanced economies rely more and more on services (cf Figure 1.1), they have 
not yet managed to dispense with natural resources to build the basic staples of modern 
life (car, plane, TV, PC, iPod, games, ...). Technological progress allows a more efficient 
use of natural resources and thus a smaller per capita demand; but in the aggregate, 
this saving is thwarted by population growth. Because natural resources offer a limited 
supply over the long run, they are deemed strategic by economic actors, private and 
public alike. They are thus worthy of our attention. 

Another interesting feature of natural resources is how they differ from traditional 
private goods such as cars, electricity or pasta. Indeed, nature is directly involved in 
their production whereas the former use only human made factors. This idiosyncrasy 
requires a specific economic analysis taking into account how nature is involved. 

We are ultimately warranted to include natural resources in this text for two reasons. 
The involved industries display either scale economiesor externalities so that we either 
find market power or the need for regulation. EC (2008) reports, pl2, that, out of 4000 
(non-fuel) mining companies with production facilities including mining, smelting and 
refining, the 10 majors make about 83% of the total value whilst the remaining 17% is 
accounted for by about 1000 medium sized and small companies. On the concentration 
front, PWC (2008) reports on the numerous M&A operations in the sector. 

18.1 Exhaustible Resources 

Exhaustibility of crude oil or natural gas is a concern insofar as the depletion of low- 
cost deposits could become soon a reality and then restrict the long-run growth potential 
of the economy. Other exhaustible resources are underground aquifers and the earth's 
capacity of the atmosphere to absorb emissions and waste. Stocks are classified into 



proven, probable and possible reserves as a way to account for increasing costs of extrac- 
tion and the uncertainty surrounding their scale. As a matter of fact, the size of the total 
reserves of most resources is still unknown. 

18.1.1 Identities 

Intertemporal Optimization 

Resources, whether exhaustible like oil or renewable like a fishery, involve stock and 
flow, thus time is of the essence for their management. To solve for the intertemporal 
conflict between present and future use, we follow Weitzman (2003 )'s canonical model of 
intertemporal capital accumulation inspired by Gray (1914) and Ramsey (1928). 1@ 

Consider a decision maker managing a lasting resource called capital, of size k, 
through the savings rate s = 4| = k. 2@ The latter may be investment in physical or human 
capital, biomass natural growth, extraction, catch or retirement. Capital is required to 
remain positive and cannot be destroyed although obsolescence or depreciation tends to 
shrink its stock. 

The objective of the decision maker in the current period is the value function v{k,s). 
Capital is assumed to be worthwhile, thus 4| > but to obtain more capital in the future, 
we must agree to a current sacrifice, hence, 4f < 0. The decision maker's time prefer- 
ences are characterized by the discount rate r (cf §19.1.2 on time preferences). The 
intertemporal objective is thus the sum of discounted values to be received in the future: 
V = / °° v{k t , s t )e~ rt dr. The FOC characterizing the optimal path is the Euler-Lagrange 
equation (derived below): 
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In all our applications, human or natural forces make capital grow at the rate O(fc), net 
of depreciation (natural or technical obsolescence). Investment is then growth net of 
extraction or output i.e., 

s = 0{k)-q (18.2) 

Output q = s - O(fc), in turn, is the main source of value for the agent although it may 
be influenced by the stock of capital, hence we have v = n[k,q). Combining with (18.1) 



and (18.2), we obtain 3 ® 
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Table 18.1 gathers several applications that are commented afterwards. 
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Table 18.1: Nomenclature for growth models 

® An entrepreneur owns a technology with obsolescence factor a generating net revenue 
R{k) (itself arising from some profit maximizing market behavior), thus profit is 
<5(fc) = R{k) - a jfc. 



© In the neoclassical optimal growth model, h q - u d q t x - u"q, hence 



= rj^ where 



r\ = z ^- is the elasticity of marginal utility with respect to consumption and | is 
the growth of consumption along an optimal trajectory. In the Ramsey-Keynes ap- 
proach, philosophical reasons lead to pick r = 0. If we further look at consumption 
and not utility (utilitarianism), then u{q) = q and we fall back on ®. 4@ 

For exhaustible, discoveries are often nil (case = 0) and the FOC involves time be- 
cause the price must grow i.e., p = p t (see later). 

For renewables, the FOC has no time derivative because demand and extraction tech- 
nology are assumed steady. 



Optimal Path 

In the long run, the optimal path tends to a steady state found by dropping the last term 
from (18.3) (cf below for an intuitive characterization). When the objective is linear in 



investment as in problems ®, ©, ®, a most rapid approach (MRA) is optimal because ev- 
ery moment spent in a state where the "steady" FOC is violated yields a lower NPV than 
might otherwise be obtained. One should thus build the asset (or stop extraction) when 
the stock is below the optimum because the asset value is larger than the opportunity 
cost r. On the contrary if the stock is above the optimum, extraction at full capacity or 
non renewal of obsolete equipments is warranted i.e., the asset should be realized (sold) 
rapidly. Insofar as there are no severe limits on the maximum rates of investment and 
divestment, we may directly focus on the optimal steady state (and its "steady" FOC). 
In practical terms, only transaction costs relative to the implementation of the optimal 
steady policy that will slow its advent. 

For the Ramsey optimal growth problem ©, the diminishing returns to consump- 
tion imply that we refuse to give away much consumption now in favor of having more 
later; 5@ because of these adjustment costs, the optimal path approaches the steady state 
in a slower fashion as shown in Weitzman (2003). 

Euler Equation 

Following Sun (2005), we derive intuitively the Euler equation (18.1) using an intertem- 
poral version of the equi-marginal principle seen in §2.1.1 (cf formal proof in appendix). 

At some time t, we consider whether to "consume" a small capital amount e by re- 
ducing investment at rate Y during a fraction of time e/Y. The direct effect is a gain, in 
NPV terms, of /3 = -e~ rj v s e. The indirect effect is due to the permanent stock decrease, 
hence a loss of a = / T °° e~ rt V] c edt. When optimizing the entire investment path, we ought 
to perform such small changes at the moment t where the combined effect a + /3 is max- 
imum. By definition of an optimal path, this effect becomes time-independent and the 
time derivative of a + p must be nil. Since ^f = -e~ Tt v\ c e while -£ - -re~ rT v s e + e~ rT v s e, we 
obtain = ee~ rt {v s - rv s - vt) which simplifies into (18.1). 

The optimal steady stock is also very easy to characterize because the NPV of a steady 
stock simplifies into v (k,0)/r. If we choose to increase A; by £, we suffer an immediate loss 
of v s $ but value, from then on, will be increased by v^. The NPV of this steady incre- 
ment is Vjc^/r. The optimal steady stock thus equates these margins i.e., r = <p{k) = M4 , 
the stationary rate of return on capital (assumed decreasing); it represents the rate at 
which present value (money) can be transformed into perpetual flows of value. 6@ Weitz- 
man (2003) further proves that if v is concave then starting from the unique stationary 
solution, it is optimal to remain forever in the stationary state i.e., there is no path 
(stationary or not) that can improve the NPV. 



18.1.2 Optimal Resource Extraction 

Hotelling (1931) studies optimal resource extraction from three points of view, a com- 
petitive firm, a monopolist, a social planner. Let us denote r the risk-free rate paid by 
financial markets, c the current average extraction cost and c t the anticipated average 
extraction cost for time t. 

Hotelling rule of arbitrage 

Imagine that a single unit can be extracted and sold. If the price is constant (or not 
expected to change in the future), sale is immediate if and only if the cost of extraction 
is lesser than the price. When the price tends to rise with time, extraction takes place at 
a time determined by the following basic financial arbitrage argument: 

• Selling now (at time t) and investing in the financial market generate a cash flow 
(1 + r) [p t - c) in the next period. 

• Postponing extraction for one period yields p t+ \ - c. 

The optimal time for selling is thus when these two options yield identical returns or 
alternatively, when the external interest rate is equal to the internal marginal net return 
on postponing extraction: r = Pf ,! 1 ~ Pf . If, for instance, the price increases at rate a < r, 
then sale occurs at the trigger price p* = -^ > c. 

More generally, the instantaneous profit from extracting an amount q t with technol- 
ogy C(.) and selling (competitively) in the market at price p t is n(q t ) = p t q t -C(q t ). The 
previous arbitrage remains valid for the owners of the firm. At time t, we can extract 
an additional unit to earn the marginal profit n q and invest the proceed in the financial 
market in order to receive the cash flow (1 + r)n q in the next period. If instead, we wait 
for the next period to realize this unit, we earn n q + ii q , where the second term denotes 
the appreciation (or depreciation) of the underlying asset with time. Profit maximiza- 
tion requires the absence of intertemporal arbitrage, thus that the financial return on a 
marginal additional extraction rn q be equal to the capital appreciation n q i.e., 

r = ^- (18.4) 

Given that extraction tends to display decreasing returns to scale around the optimal 
level of activity (n qq < 0), the firm is able to continuously fine tune extraction in order to 
achieve (18.4) which a particular case of (18.3). 

In our particular case, we may solve this equation to find n q = Xe rt where A is the 
shadow price (in present value) of the initial stock of the resource. This is the Hotelling 



rule of extraction: marginal profit grows at the interest rate 

This rule can also be derived using the "manna" trick (cf. §2.1.1): if the resource size 
is increased by one unit, the latter should be extracted at the time yielding the largest 
NPV. In the presence of decreasing returns to scale, the NPVs tend to become equated by 
the application of this rule, hence when extraction is spread over time, an equi-marginal 
intertemporal principle should hold: the marginal profit, in NPV terms, must be constant 
across periods. Now, given an extraction path q t , the latter is n q {q t )e~ rt , hence n q must 
grow at the rate r. 

Solving (18.4) analytically requires a rather crude specification of either the extrac- 
tion cost or the market price process which we take in turn hereafter. 

Competition 

A competitive firm takes the current price p and the future price p t as given. Since a 
unit of sale made today yields an average profit po~co, it can be invested at the risk free 
rate to yield {po-co)e rt at time t; the firm is indifferent between selling an additional 
unit now or later (at time t) if and only if p t -c t = {po - co)e rt . From this basic arbitrage 
argument, we deduce that the most easily accessible resources are exploited first. In- 
deed, their associated average cost is smaller, thus yield a greater current profit which 
commands immediate extraction. 7 ® 

If the resource owner is a small entrepreneur, it makes sense to replace the market 
rate r by the individual's rate of time preference. The Hotelling rule now tells us that 
people who favor present consumption exploit their resources at a faster rate more than 
those who favor the future. 

Efficiency 

The instantaneous welfare is the concave increasing function W{q) = U{q) - C{q) where 
U{q) = Jq P(x)dx and P{.) is the aggregate WTP, deduced from market demand. The dis- 
counted welfare associated to an extraction path q t is thus V = / r W{q t )e~ rot dt. If the 
resource is overabundant, we can sustain forever the traditional efficient level q* solv- 
ing P = C m . If, as is most often the case, the resource comes in a limited supply A;, then 
the consumption path must satisfy Jq q t dt= k where T is the final date of consumption 
i.e., such that qj = 0. The "manna" trick (cf. §2.1.1) tells us that the marginal welfare in 
NPV terms v t = [P{q t ) - C m (q t )) e~ rot must be constant across time. Denoting p - c Q this 
initial value, we obtain p t -c t = {po-co)e rot where p t = P{qt) and c t = C m {q t ). We have thus 
shown that perfect competition is conducive of efficient management of the resource. 
The optimal duration T is determined as follows. From the final condition P(0) - 



C m (0) = p T -c T = (po - c )e r ° T , we derive p -c = (P(0) - C m (0)) e'^ thus p^-clm = e r ° u ~ r) . 
Since p t - c t increases with time and P -C m is decreasing with quantity, the efficient 
consumption diminishes with time. Another proof goes by contradiction: if quantities q 
and q' > q were consumed at times t and t > t, they could be switched thereby generating 
an increase in value because more is consumed sooner. 

This optimal path is parametrized by T. Integrating it over [0; T] yields the total 
extraction as a function of T; equating with the limited resource A;, we derive the optimal 
duration 7\ 8@ 

Monopoly and Oligopoly 

The monopoly maximizes V = f U{q)e~ rot dt under the restriction f q t dt = k. By the 
intertemporal arbitrage argument seen above, U m {q t ) = Ae r ° f for some A > 0. If U m is 
decreasing then the previous study applies: optimal extraction decreases with time and 
the optimal duration T is determined by the method exposed before. 

Discussion 

Most mineral resources are exploited in a similar rapid fashion. The owner is either 
a private firm whose stockholders are foremost interested by short term dividends or 
the public monopoly from a country ruled by a despot. 9@ In both cases, the decision 
maker tends to severely discount the future. If furthermore, he is optimistic, he believes 
that new discoveries will be made and that substitutes will be discovered, so that this 
particular resource will never become scarce. If this was to be true, the price would never 
rise to the choke level (it could even decrease). Under such circumstances, it makes sense 
for the myopic decision maker to extract his resource as fast as possible and invest the 
returns in financial markets or use them for immediate consumption. 10 ® 1 When such a 
short-sighted behavior is generalized, aggregate supply tends to be large so that the 
price of the commodity remains low which rationalizes the original belief (until we hit 
the wall if the discoveries or substitute technologies fail to sprout). 

18.1.3 Examples 

Constant Returns to Scale for Extraction 

If the price is time varying while marginal cost is constant, (18.4) implies that the profit 
of a single extractor satisfies 

Pt -coce rt => ElZl = e rt <> PlZl = e nt-T ] (lg5) 

po - c p-c 



where T is the terminal time where the last remaining unit is sold (infinite if there is no 
choke price p such that D(p) = 0). Letting k denotes the aggregate stock of the resources 
(proven reserves), we can derive T by the intermediate value theorem using the equality 
between total sales and total purchases. 

k = [ q t dt = f D{p t )dt (18.6) 

Jo Jo 

We can solve (18.5-18.6) for the linear demand D{p) = a-bp with associated efficient 
instantaneous consumption q* = a- be. Since WTP is P{q) = ^-, we have p t -c- - ~~ b qt , 
thus (18.5) o 3*z£i = e r[t ~ T) =>q t = q*{i- e r[t - T) ). Plugging into (18.6) yields 

— = (1 - e rit - T] ) dt=T- *—?— (18.7) 

q* Jo y ' r 

where the LHS is the myopic time left to enjoy the constant stream that society would 
extract ignoring the limitedness of the resource. To enable numerical approximation, 

-T-K 



observe that the foresighted duration T* solving (18.7) is larger than T = -£■ + 
-£r+ -, a rough aproximation when rk » q* . 11@ For instance, foresight asks to add another 
20 years when the myopic time is a century and the rate is 5%. Also, the efficient fraction 
to extract at time T is still 63%; it then decreases rapidly until 5% for the last year. 

The monopolist, unexpectedly, can be deemed "the friend of the earth" because he 
would extract the resource at half the ideal speed for a social planner. Indeed, (18.4) 
for the same linear demand uses R m - c = - ~ b - instead of p - c in (18.5). Solving yields 
q t = \q* (l - e r{t - T) ), thus (18.7) becomes || = T- 1 =£^, hence the approximation T m for 
the monopolist's optimal duration T m becomes T m - -Jf + \ which is quite larger than the 
efficient level. 12@ 

Decreasing Returns to Scale for Extraction 

If the price is expected to remain constant at level p but extraction displays DRS with 
C{q) = F + \cq 2 , the FOC (18.4) becomes -cq - r{p - cq) => 7 c q _ = r => ?- q t = Xe rt =^> q t - 
2 - Xe rt . To find the closing conditions, we use k - f T q t dt = ^- - j{e rT - 1). We also have 
qr equal to the minimum efficient scale (when marginal and average cost are equal) 
If there is a last time of extraction T where qr = k T , then the NPV of final extraction 
[pqr - C{qr)) e~ rT must be equal to the NPV Xkj of the remaining stock. Combining with 
the Hotelling rule implies that C(qr)/qT = C {qr) i.e., the last extraction is performed at 
the minimum efficient scale. 

If we now look at the efficient extraction path, we maximize W(q) = f T [U{q t ) - C{q t )) e~ rt d 



under the resource constraint fj q t dt < fco where the marginal utility U m {.) - P{.) is 
the market WTP for the resource. Working out the same arbitrage as before yields 
U m {q t ) - C m {q t ) oc e rt and since there is equilibrium on the market for the extracted re- 
source, we have p t = P{q t ) i.e., the Hotelling rule for private competitive ownership. 

CES demand 

Assume zero production cost and a CES demand D{p) = p~ £ leading to WTP P{q) = q~ lle . 
The previous method does not apply since no finite T is feasible, thus we work with 
T = +00 and we solve p t = poe rot => q t = pQ £ e~ £ ' ot => A; = J +o ° q t dt = |^- => q = p^ £ = er k so 
that at each moment, a fraction % = er e~ £rot of the reserves is extracted. Notice that a 
monopoly would apply the same rule because his marginal revenue is a multiple of P. 

18.1.4 Commons 

The economic exploitation of earth's assets all share a fundamental characteristic, their 
limited size. 13@ Meanwhile these resources were thought to be infinite or much larger 
than aggregate human demand, they were treated as free inputs for the production of 
intermediate goods and thus ignored by economic analysis. 

The modern epoch is however one where the natural limits have started to bind be- 
cause technology enable intensive extraction and population growth fuels demand for 
these valuable assets, nowadays called common pool resources (CPR or commons for 
short). The fact that open access or free entry leads to excessive exploitation (wrt. an 
efficient value maximizing use) has been known in the field of agriculture to classical 
Greek economists ; nowadays, the dramatic label tragedy of the commons captures the 
issue. 14@ 

Environmental and resource economics developed to address the many problems re- 
garding the management of earth's commons. Within the realm of industrial organiza- 
tion, commons are the fruit of human effort with for instance, the current stock of knowl- 
edge, transportation networks (e.g., highway, railway, pipeline) or public buildings (e.g., 
hospital, school, stadium, concert-hall). Solving for their inefficient over-exploitation is 
difficult as the issue bears much similarity with the inherent instability of a cartel (cf 
§9.1). Both situations display a payoff structure similar to the prisoner's dilemma: if 
several firms cooperate not to over exploit the common then all get a large payoff but it 
is always tempting to extract a little more than partners so that in the end everybody 
overuses the common! 



Open Access Equilibrium 

The tragedy of the commons is given an analytical treatment in §7.2.3; the equilibrium is 
tilted away from the efficient management of a monopolist as new firms enter the Com- 
mons. As n /, individual effort, extraction and individual profits diminish but aggregate 
effort and extraction both increase. An obvious consequence for the future (not consid- 
ered in the present model) is a worsening of the extraction technology since the resource 
becomes scarcer (better sources are exploited first). This process last until the entry cost 
starts to bind or when the resource become so crowded that marginal extraction becomes 
negative. 

Privatized vs. Free Access 

Weitzman (1974b) shows that economic agents enjoying free access to a common derive 
a higher income than when the resource is privatized. Whenever the proceeds of pri- 
vatization are unevenly distributed, a frequent occurrence, users of commons end up 
worse, thus politically oppose privatization. This transaction cost may well explain the 
pervasiveness of free access for the management of commons. 

At the root of a resource's over-exploitation is the fact that free entry occurs until 
the average return is equated with cost whereas efficient use commands to equate cost 
with marginal return. This is the dual of monopoly pricing where the monopoly cares for 
marginal revenue whereas society cares for the average. Technically, two conditions are 
required for the tragedy to take place. Firstly, the good or service under consideration 
requires two complementary production factors, capital that comes in fixed supply (in 
the short term) and labour that is elastically supplied according to wage or return. Next, 
the technology must display decreasing returns to scale. 15 ® 

The production technology at resource i = A,B is qi = (p^li) where U is the labour 
input employed at resource i. The market clearing conditions for input and output read 
S[w) = Ia + h and D{p) = q A + q B where labour supply S{w) is increasing while product 
demand D{p) is decreasing. 

Under free access, labour is free to go to any resource, thus the law of one price applies 
and, in equilibrium, there is a common average return w from working in any resource. 
Since labor is an homogeneous production factor, its return is the same for all units, 
thus it is the average return p^-^. Combining these two observations, we obtain the 
equilibrium condition -^ = ^j-^. It then remains to plug this condition into the market 
clearing equations to derive the equilibrium price p, wage w and labour use U for i = A,B. 

In the privatization equilibrium, each resource is managed to maximize profit ni = 
ptyAli) - wli, thus optimal labour contracting at resource i solves -^ = (pAk) < < ^ L r L - (by the 



DRS property). It then remains to plug this condition into the market clearing equations 
to derive the equilibrium price p* , wage w* and labour use /* for i = A,B. Since mar- 
ginal cost equates marginal return, a positive rent n* = p*(p t {l*) - w*l* obtains for each 
resource. An alternative way to implement this allocation is to set an access fee n = n*ll* 
for resource i = A, B and leave laborers choose where to employ themselves (after paying 
the fee). 

The privatization equilibrium coincides with the efficient allocation. Indeed, welfare 
is W = Jq° D ' 1 {x)dx- f ° S _1 U)dx i.e., the aggregate WTP of produced units minus the 
aggregate opportunity cost of used labour. Given the market clearing equations, the FOC 
of efficient labour input for resource i is D'H^o) -§t = S^{L) which implies the previous 
FOC (pAli) = 77 since S{w) = l Q and D{p) = q . Notice that total output is maximum at the 

I p 

efficient allocation. If not, there would exist a labor vector {I a, h) such that Z^ + /# < Ia + ^b 
(weakly cheaper) while output would be greater i.e., qA + tfB > ^*a + c Ib- If so > i n at least one 
resource, we have ni = p*(p t {li) - w*U > p*0 2 -(Z*) - w*l* which is a contradiction to profit 
maximization. 

We now show that laborers working at a resource earn less under privatization than 
under free access i.e., w* < w. If, on the contrary, iu < w* is true, then labour supply is 
lesser i.e., S{w) < S{w*), thus, in at least one resource i, we have less employment i.e., 
It < l* and as a consequence of DRS a greater average return i.e., - L J- < ^p-. As shown 
before, the privatization equilibrium maximizes total output, thus involves a lower price 
i.e., p* < p. Since resource i fetches a rent under privatization, we have w* < p* -U- < 

i 

p^p- = w because profit is nil under open access; this is a contradiction so that w > w* 

n 

must hold true. 

We have thus shown that free access guarantees a higher wage to every laborer work- 
ing in the resources and also attracts more laborers to the commons. 16@ It is no surprise 
then that if the proceeds of privatization are captured or unevenly distributed, there will 
be a majority of laborers opposed to the reform even though it creates wealth. 17@ In the 
case of congested networks, this political economy result indicates that access should 
never be free at times of peak demand (in order to eliminate congestion) and that all po- 
tential users should get free passes or rebates in order to compensate them for the lower 
utility they derive from the positive peak price. 

Restoring Efficiency 

We have seen that in a regime of open-access, commons are over-exploited. To restore 
efficiency, we must devise ways to limit the activity of firms. One well known possibility 
is to tax output or edict constraining regulations. 18@ This way, the extractive technology 



worsens and become dearer to operate; firms are thus rationally lead to extract less. 
This, in turn, increases the economic benefit generated by the CPR (higher efficiency). 
Unless there is an explicit mechanism to redistribute the tax revenue, profits are likely 
to fall with the tax. Indeed, the loss of revenues due to taxation is a direct effect while the 
gain due to higher efficiency is an indirect one and it is rare to see the latter overpower 
the former. 19@ It is thus necessary to channel some the newly created wealth to firms in 
order to compensate them and win their support for the tax policy. 

An example is the grandfathering of tradable quotas for carbon emissions. 20@ In the 
same context of carbon taxation, Stoft (2008) proposes the "untax" whereby the entire 
tax revenue is refunded to firms in a lump sum manner (or at least in a way that is non 
correlated with the extraction rate of each firm). The formal mechanism achieving this 
objective is presented in §7.2.4. 

18.2 Renewables Resources 

18.2.1 Introduction 

Sustainable Development 

Our everyday actions have a very small impact on the environment and the global 
amount of natural resources. Yet, once aggregated at the earth level, they become signif- 
icant. These phenomena were not relevant in the past but it is now clear that pollution 
and waste of renewable resources or the extraction of exhaustible ones have very costly 
consequences both in monetary and utilitarian terms. After ignoring them altogether, 
economic theory has started to address these issues by treating them as negative exter- 
nalities. 

Sustainable development thus refers to economic development that would not endan- 
ger the development of our ecological container, the earth. The constraint being difficult 
to measure, one generally associate an ad-hoc external cost to every damage done to the 
biosphere. 

Biological and Flow Resources 

In economics, a natural resource is said to be renewable if its stock has the ability to grow 
(regenerate) significantly within a human generation. 2 1@ We distinguish two categories: 
biological as with animals and plants or flow resources such as solar, wind, waves, tides, 
streams all of whom derive from the sun's activity. A renewable resource thus grow by 



"the gift of nature" whereas physical capital (buildings, machinery) or human capital 
depreciates with time and can only grow by applying human effort. 

Whenever a biological resource (crop or animal) grows on a piece of land under the 
private property regime, it has a well defined owner who will realize the asset (harvest 
or slaughter) when it reaches commercial maturity which may differ from the biological 
one. In that case, the asset is akin to a standard good produced using capital, labour 
and consumables. The one exception is forest because trees grow slowly so that it may 
be more profitable to cut a young tree rather than wait more years to harvest more wood 
out of it. 

Except for fish farms, fish (and other aquatic beings) live in the open sea (or rivers) 
so that the private property regime does not applies since it is quite hard to delineate 
frontiers on sea. 22@ There is thus open-access in the sense that anyone is free to extract 
from the population at any level of effort. Gordon (1954) relates this characteristic of 
fisheries to the problem of excessive extraction and stock depletion (cf over-crowding in 
§18.1.4). 

18.2.2 Stock, Flow, Extraction 

We view a biological resource as an asset and its stock as capital. We consider the law 
that governs the evolution of its stock as well as the interaction with human extraction 
from the stock. 

Natural Evolution and Equilibrium 

The canonical model of intertemporal capital accumulation evolves according to (18.2) 
where k is now the stock of the natural resource, s the net periodic change, O the absolute 
natural net 23@ growth and q extraction. 

We define the natural net rate of growth as p{k) = ^jp. If left to itself (i.e., q = 0), the 

stock evolves according to 

k 
k = s = ®(k) o - = p(k) (18.8) 

/c 

When the rate of growth is a constant p, the solution to (18.8) is k t = k Q e pt where k 

is the initial stock size (indeed, e° = 1). 24@ The problem with this approximation when 

p > is divergence towards infinity in the long run. 25@ It is obvious that since the earth 

is of limited size, no living population can extend indefinitely. For instance, a fishery 

cannot grow infinitely because the lake or sea has a finite size and is therefore home to a 

finite amount of food (for the fishes). The net growth rate must therefore be decreasing 

with the total stock A; to reflect the environmental limitations to growth; we speak of 



compensation. 

A frequently used model is Verhulst (1838)'s logistic equation stating that the ratio 
of current to maximum growth rate is the degree of freedom of the population i.e., ^- - 
^p where p is the underlying (or maximum) growth rate and k the carrying capacity. 
Normalizing the unit of measure for the stock such that k- 1, we have 



p{k) = a-k)p 



(18.9) 



so that (18.8) has now two obvious long term population equilibria (steady states solving 
k = 0), namely disappearance and the carrying capacity the latter being asymptotically 
stable while the former is unstable. The exact solution to (18.8-18.9) is 26@ 



l-i-fl-i).-^,, -1 

k t Uo ) i + lzJso e -pt 

K0 



(18.10) 



The curves for constant growth rate and compensation are shown on Figure 18.1. 




Figure 18.1: Population Dynamic and Extraction 



Extraction and Effort 

Let us now introduce extraction q from the stock as the aggregate extraction of all firms 
exploiting the resource. A steady extraction solves s = k = in (18.2), thus q = O(Ar). 
In the logistic case (18.9), we find 



q = p{k)k = (l-k)pk => fc=-(l± v / l-4^/p) 



(18.11) 



Of these two solutions k< k shown on Figure 18.1, the former is unstable and the latter 
asymptotically stable. Indeed, ^ < as soon as k t is outside the roots. Hence, if at some 



point in time, for an unknown reason, we have k t < k, then population will decrease 
and irremediably converges to zero. In the middle range, k<k t <k^>^j>0so that 
population tends to the larger root (lim +00 k t = k). Lastly, if k t > k happens, then the fact 
that ^f < holds true drives back the population towards the upper root (lim +00 k t = k). 
Summarizing, the long term population equilibria are extinction and a positive steady 
level. 

A sustainable yield or steady output is an extraction rate q = O(fc) associated with 
a steady stock k i.e., small enough to enable the stock to maintain itself at a steady 
level forever. The maximum sustainable yield (MSY) is q = maxjtO(fc) = | which stabilizes 
population at k = ~ (50% of the carrying capacity). As can be seen on Figure 18.1 and eq. 
(18.11), when q increases toward q, the roots k and k move toward k. This also implies 
that an extraction rate larger than q (however small is the difference) leads to the total 
depletion of the stock. 27@ 

We now account for the fact that extraction is a human decision made to fulfill some 
objective. It makes sense to assume that the extraction rate q depends on both the stock 
size k and the aggregate extraction effort L. Schaefer (1957), in the context of fishing, 
proposes q = kL (after normalizing the effort unit). A steady effort L is sustainable only 
if A: = in (18.2) i.e., 

kL=q = ®(k) = kp(k) = k(l-k)p o k L = l-- (18.12) 

P 

with L < p to guarantee a positive solution. The sustainable output is then 

q L = k L L = L\l--] (18.13) 

reaching its maximum, the MSY, at L = ~ . Notice that L> p does yield more than the 
MSY in the short term, but over the long term, such a policy irremediably depletes the 
entire stock. 

Depensation The reproductive technology of the population displays depensation if 
p{k) is increasing over some initial interval as shown on Figure 18.2; it means that the 
underlying population reproduces itself faster and faster (in this range). The population 
equation (18.12) may then have a large stable solution k 2 and a smaller unstable one fci 
such that if population drops (for whatever reason) to some level k < k\, the population 
will converge toward zero. 

This model also depicts an hysteresis effect. If effort L > L is achieved (in the absence 
of regulation) then the population will quickly decrease. A regulation may reduce extrac- 




k i r< k 2 

Figure 18.2: Population Dynamic and Depensation 



tion effort back to a lower L but this is not sufficient to ensure that the system returns 
to the level k 2 that enables a sustainable yield; an additional reduction in effort down 
to L is required because the population has fallen into a perishing trap (below A^). If 
the curve binds too much at zero, then once the population has fallen below the critical 
depensation, it will inevitably come to extinction. 



18.2.3 Static Equilibrium 

We now use the previous steps to analyze the equilibrium stock according to a variety 
of human behaviors i.e., whether extraction is a competitive, monopolistic, oligopolistic 
or socially planned activity. We first look at a crude static optimization where all actors 
seek to maximize a steady income (net annual revenues). 

We assume that extraction efforts provided by active firms are perfect substitutes, so 
that only the industry aggregate matters for assessing the evolution of the stock. Using 
the previous characterization of the extraction technology, the industry's cost function is 
C{k, q) = wL = ^- for q < q where w is the (non normalizable) opportunity cost of a firm. 
Denoting p the market price, profit is 



n = (p Q -^)qoc[p-l)q 



(18.14) 



where p = ^ is the relative price of the output. 



Efficiency 

Welfare is W{k, q) = U{q) - c{k,q) for q < q where U{q) = J^ P{x)dx and P{q) is the social 
WTP for output usually derived from the market demand. Since each (steady) output is 



associated with a small and a large (steady) population, it is preferable to go with the 
large one to reduce cost. Now, if the output of the resource under study is small wrt. the 
world output then WTP computed at the local MSY P{q) is likely to be greater than unit 
cost, hence welfare is an increasing function of output. Insofar as we aim at maximizing 
income while maintaining stock constant, efficiency commands to extract at the MSY. 

Oligopoly 

The oligopolistic interaction over a common pool resource is a contest treated in §7.2.3. 
The FOC for profit maximization, (7.8), is a weighted average between marginal and 

average benefit. 

1 n-ln ,..„ „_ x 

= -n m + (18.15) 

n n L 

This enables us to identify the monopoly with n = 1 (maximizing n) and open-access 

with n = +oo (solving for n = 0, the free entry condition). Using (18.14), the solution to 

(18.15) is L" = T^j-^-p and total effort at the Nash equilibrium is 

l^JL^Ez1 p (18 . le) 

n + 1 p 

leading to steady stock k n = 1 - y using (18.12). Lasltly, using (18.14), the equilibrium 

profit simplifies into n n = {pk n - l)L n - p ~ 2 P . Industry profit is thus U n = nn n and it is 
a matter of algebra to check that the ratio of oligopoly to monopoly profits is 4 " 2 as in 
the standard Cournot model (cf §5.12). 

Notice that no extraction takes place if p < 1 since it is not worthwhile, independently 
of the industry structure. If the absolute output price po rises or technological innovation 
reduces cost w, then the equilibrium extraction increases. 

Monopoly 

When the resource is appropriated and successfully managed by a single decision maker, 
its economic value can be maximized. As can be checked from (18.16) for n = l, the mo- 
nopolist's optimal effort is L s = ^-p leading to a steady stock of k s = ^— and a maximum 

economic yield (MEY) q s = ^rP < q = f , the MSY. We may thus conclude that the static 
monopolist refrains from extracting too much. Notice that when either the output price 
Po rises to infinity or effort cost w vanishes, the MEY converges towards the MSY from 
below (since p diverges). 



Open-Access 

When the resource is under an open access regime, any firm or entrepreneur is free 
to extract from it. If furthermore, there are many small firms then we can assume 
competitive behavior. The process of entry and exit will then drive the individual firm 
profit towards zero. 28@ As can be checked from (18.16) for n = +00, total effort 29 ® is twice 
the monopolist choice i.e., L° = 2L S ; it leads to a steady stock k° = K Over-extraction i.e., 
L° > L, is bound to happen when the relative output price increases indefinitely because 
L° -» 2L; this has been the case in the history of most fisheries as shown early on by 

p— 'CO 

Gordon (1954). 

To counter such a negative outcome, a cooperative reduction in effort from L = A to 
L = B, as shown on Figure 18.3 would lessen cost and increase yield in the long term. 
The problem is that the short term effect of a decrease in effort is a fall of the catch; it 
lasts for the time necessary to rebuild the fish population that will later enable a more 
comfortable catch for less effort. This poses the problem of weighting the future (discount 
factor) to compensate the fishermen for their temporary losses. 
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Figure 18.3: Effect of a regulation 



The solutions to restore efficiency in the use of the resource are difficult to implement. 
If the government limits entry using quotas for fishermen, it may reduce catch in the 
short-term. For once, this benefits only incumbents because the price is driven up by 
the limited supply. The main problem thought is that incumbents will quickly increase 
their fishing capacity because the forces that drive over-fishing have not changed at all. 
Thus, an outcome identical to the open-access equilibrium takes place. Furthermore, 
incumbents are likely to waste financial and human resources to lobby the government 
to maintain the barriers to entry in their activity. The quota must therefore apply to 
boats (measured by their weight) rather than businesses. 

An important policy implication of the open-access regime is that firms (or individu- 
als) who stay in the business have the lowest costs, thus the lowest outside opportunity 



for doing another job leading to significant social problem when trying to regulate this 
activity. Notice that only infra marginal firms (those with lower cost) obtain an economic 
rent from extraction. 

18.2.4 Dynamic Equilibrium 

The analysis so far looked at steady incomes obtained by maintaining a steady stock. 
Any economic agent exploiting the resource prefers current to future income and is thus 
tempted to extract more right away. Following the initial effort of Scott (1955) and Clark 
and Munro (1975), we flesh out this intuition in our simple set-up. 

Monopoly 

Since the objective is linear in stock, we can concentrate on the steady state version of 
(18.3) stating that the discount rate of return must equate the renewal rate plus the 
stock effect: 

r = O fc + — (18.17) 

Using n = [p-\)q from (18.14) and q = O(fc) = kp{k) = k{l - k)p (cf 18.9) at the steady 
state, (18.17) becomes 

(l-k)p/k l-k /,„.,„x 

r = p(l-2k) + - ^— o r] = l-2k+ (18.18) 

p-l/k pk-l 

where r\ = - is the bionomic growth ratio. Re-arranging (18.18), we obtain the following 
quadratic equation in A; 

-fl + --2A:) = l-— (18.19) 

t] \ p I pk 

where the LHS is decreasing linear while the RHS is an increasing hyperbola. The 
intermediate value theorem then applies to yield a single positive solution k v since 
LHS\ > > -oo = RHS\ and LHS]^ = -oo < 1 = AHS^. 30 ® Note also that the price must 
be greater than unity for otherwise the RHS and LHS would have opposite signs. Given 
this geometrical characterization, the comparative statics are quite obvious: the optimal 
stock is decreasing in both the bionomic growth ratio and price since the LHS falls as 
either r/ or p rise while a rise in p pushes the RHS towards its limit. 

We can relate the optimal dynamic rule k^ to the static one, k s and the open-access 
one, k° by observing that (18.19) o 2k(k s -k)= r}{k-k°); thus we have k s = k Q > k v > k +OQ = 
k°. Hence, the extremes are when one cares exclusively for the future (r = 0) or for the 
present (r = +oo). This identification means that the optimal policy in a dynamic setting 



gears towards one or the other according to the discount rate of the decision maker. 31 ® 

The long term yield q n = O [k^] is depicted on Figure 18.4. Several curves are depicted 
according to the bionomic ratio. For r <k p (i.e., q « 1), extraction increases with price. 
At the limit where r = 0, it converges to the MSY for an infinite price. Now for r close 
or larger than p, extraction rises with price up to the MSY at k n = \ o p = 2 + i (cf. eq. 
(18.19)) and then start to decrease as a consequence of over-extraction triggered by a too 
attractive price. If the demand is elastic and large as with Di then the equilibrium price 
is high the consumer surplus small because there is over-extraction. It is only for a low 
demand D 2 that a more traditional and efficient equilibrium is obtained. 
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Figure 18.4: Economic equilibrium 



Public Management 

If we can neglect the externalities linked to the extractive process, welfare in this setting 
is the gross utility from output minus the extraction cost i.e., W{k, q) = U{q) -c{k, q) where 
U{q) = $ P(x) dx and P{q) is the social WTP for output derived from the market demand. 
Since the social planner maximizes the NPV of W instead of n, (18.17) becomes r = 
®m + IJr- Now, we have W q = P{q) - c{k) as U'{q) = P{q) and W k = n k , hence the only change 
wrt. (18.18) is to replace the market price p by the WTP P(<D(fc)). Graphically, the social 
optimum is found on Figure 18.4 by intersecting the plain curve (q > 0) with the demand 



curve. 

Regulation of the open-access equilibrium through a Pigouvian tax or transferable 
quotas is possible in theory but hard to compute as it draws on the knowledge of the 
natural growth rate p. Furthermore it is hard to implement at the political level since 
the government captures all the economic rent generated by the tax or the auction of 
quotas. 32 ® Technically, the efficient steady output q* = 0(A;*) (cf. (18.2)), if computable, 
is the global amount of transferable quotas that ought to be awarded or auctioned to the 
industry. Since the demand for quotas decreases with their price, the intermediate value 
theorem guarantees that there is a quota price making the overall demand equal to q*; 
this price is the sought after Pigouvian tax. 

Depletion 

Observe that the stock is never fully depleted in the current model. Yet, as soon as 
the cost of extracting the last unit is finite, open access leads to this fate for a large 
enough price as shown by Clark (1973). Indeed, for the unit cost c{k) = ^, the free 
entry equation equation n = o p = c{k) has solution k = | -e which becomes negative 
when the price overshoots lie. The same fate occurs if the rational forward-looking 
monopolist has a strong preference for the present (case where the LHS of (18.19) tends 
to zero). Depletion however never takes place under rent maximization because the 
optimal choice solves ^— = p{k + e) and tends to one half of the carrying capacity when 
the price grows indefinitely. 

Dynamic Oligopoly 

Levhari and Mirman (1980) analyze a dynamic duopoly interaction under the restriction 
that firms play Markovian strategies i.e., at each period, their extraction depends only 
on the remaining stock and extraction choice of the competitor. These authors consider 
a mathematically convenient model: the natural resource grows geometrically at rate 
a < 1 around its long run natural equilibrium (normalized to unity) i.e., K t +\ = Kf; this 
ad-hoc specification has an embedded stability i.e., when the stock leaves its long term 
level, it is drawn back to it. We first take a look at the monopoly case. 

Monopoly A firm with discount factor 8 is granted the right to exploit the resource for 
T periods. Assuming decreasing returns to scale for the market where the output is sold, 
we can use the logarithm In as a valuation function. 

Consider first T = 2 and denote K the initial stock, a the immediate extraction and b 
the later one. The NPV of this extraction strategy is U = In a + <51n b o e u = ab s . Given a, 



the stock for the last period is {K- a) a and it is obviously optimal to extract everything 
in the last period, hence b= {K- a) a . We then have e u = a{K- a) aS , so that the optimal 
initial extraction is a* - — where y = 1 + a8 ; we then recover e u * oc - . 

The result readily extends to any finite number of periods. Indeed, define y T = 1 
(complete extraction is optimal in the last period), y t = 1 + ady t+l for t < T and make the 
induction hypothesis that a* = y and that e Ut oc I — \ ' for t > 1. The stock at the beginning 

of the second period is b = [K- a) a and by the induction hypothesis e U2[b] oc l — \ \ Since 

U\ - \na + 5U2{b), we have e Ul oc a\ l — \ 2 oc a{K-a) aSj2 so that the optimal extraction is 
^* = Trikr = & which proves our claim. 

We perform the economic analysis of the optimal behavior for a large duration T. 
Solving for the recursive equation, we obtain the periodic optimal rate of extraction ^ = 
v- - 1 /xfZ-t+i whose behavior for large T depends on the sign of 1 - a8. The capitalist 

It l — ((xo) 

owner of the monopoly sets 6 = j^-p where r is the market interest rate. For a biological 
resource like a fishery, we have a = 1 + p where p is the growth rate of the population. If 
the natural resources grows so fast to the point that p> r holds true, then a5 > 1 meaning 

that -7- is initially very small i.e., it is better to wait for the stock to grow and extract a 

it 

lot in the future. If the resource is fossil p = or grows slowly (p < r), then aS < 1 and in 
that case the optimal extraction behavior is to extract a percentage -J- - r - p of the stock 
at every period. It is only when the final period approaches that depletion starts i.e., the 
extraction rate comes close to unity. 

Duopoly Since in the last period everybody wants to extract everything, we need to 
set a boundary condition. The most natural one is equal sharing with a^j = K T I2. Given 
stock K t and challenger extraction a- ]tt , the available stock is K t - a,j tt , thus we may use 
the previous induction reasoning to prove that the best reply is at <t - 1+g g. y J ' f where 6t 
is the individual discount rate used to compute y it . Using the symmetric formula for the 
other firm, we obtain a system characterizing the equilibrium; its solution is 
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for T large. 

The import upshot here is that the standard Cournot result carries one: the re- 

a*. T +a*. T 

source is over-exploited with respect to the behavior of a monopoly. For 8i = 6j, '' K - = 



i-a-'gfz > 1 - a<5 o (l - a8) 2 > 1 - 2a/3 which is true. Obviously, the two firms may form a 
cartel and sustain the efficient extraction path by the threat of reversion to the above 
stationary strategies in case of defection (aka. trigger strategy). 



PartH 



Incentives and Information 



Information Economics 

As recalled by Laffont and Martimort (2002), for centuries, farmers have labored the 
fields belonging to their landlords, apprentices have worked under the orders of their 
master craftsman. The motivations behind these behaviors are not obvious to guess. For 
long, authority was seen as the major explanation but with the development of trade and 
industry, the economic motivation came to play a role. The theoretical debate is launched 
by Smith (1776) who criticizes sharecropping, the metayage system, where the farmer 
and the landlord share evenly the harvest because it acts like a tax that inefficiently 
reduces the incentive of the farmer to apply labour on the land. Hence, whenever the 
landlord can not monitor the work of the farmer to enforce the adequate amount of effort, 
we have a clear example of what is now called "moral hazard" because the farmer will 
shirk or cultivate his personal vegetable garden. 

Task delegation occurs because the landlord has a limited time and ability but above 
all because the division of labour and the specialization enable great economies of scale. 
Given the complexity of most productive activities today, delegation goes along with the 
loss of supervision, the inability for the landlord to monitor or control the activities of the 
farmer. Being left alone performing his task, the farmer learns more than the landlord 
about all economic aspects of the productive activity he is involved in; he acquires private 
information. Furthermore, the very existence of delegation, means that the farmer is 
more or less able to orient the activity as he wishes. 

The modern theory calls principals those who pay, agents those who receive payment 
in exchange for their effort. To ease exposition, we systemically refer to the principal 
as "she" and to the agent as "he". Their relationship form an agency and whenever 
one party holds (relevant) private information, there is asymmetric information. The 
landlord-farmer case is only one among the many situations fitting the formalization as 
shown in the table Table 18.2. 



Principal 


Agent 


Activity 


Information 


landlord 


farmer 


cultivation 


weather 


client 


attorney 


defense 


relation of the case to law 


saver 


broker 


investment 


market opportunities 


stockholder 


manager 


industrial policy 


market conditions 


government 


builder 


public work 


cost 


city 


utility 


water 


technology 


insurer 


policyholder 


health 


genetic risk 



Table 18.2: Examples of Agency Relationships 



Classical authors have long noted incentives as an important issue of social behavior 
but have not made the connection with economics. At the same time, economists viewed 
information as irrelevant to their domain. It is only recently that incentives have been 
understood as an information problem that hinges strongly on economic efficiency in 
many areas. To make sure that the efficient action will be carried out by the agent, 
the principal must elicit his private information and then convince the agent to perform 
that action. The latter part is by no means trivial; as we can check from the examples 
in Table 18.2, the principal can hardly control the effort (e.g., time, money) expanded 
by the agent in the activity he is getting paid for. From an analytical point of view, we 
are facing a problem of hidden information and/or hidden action. The usual terminology 
nevertheless follows the insurance vocabulary because the first theoretical works were 
developed in that field. 

The classical case of hidden information is when risky people try to get maximum 
coverage; insurers speak of an adverse selection because all premiums will have to be 
raised and may discourage those at less risk from insuring themselves. Akerlof (1970) 
introduces the term Adverse Selection when citing a 1964 textbook on insurance "There 
is potential adverse selection in the fact that healthy term insurance policy holders may 
decide to terminate their coverage when they become older and premiums mount. This 
action could leave an insurer with an undue proportion of below average risks and claims 
might be higher than anticipated." 

The typical circumstance for hidden action is when drivers could drive with care to 
avoid accidents but fail to do so once they are fully insured; this behavior is known 
as moral hazard. Health insurance professionals denned moral hazard in the 1960s 
as the "intangible loss-producing propensity of the insured individual" or the "hazard 
that arises from the failure of individuals who are affected by insurance to uphold the 
accepted moral qualities". Pauly (1968) accurately showed that this was no morality 
problem but a simple consequence of rational economic behavior: by spreading the cost 
of my health-care over the entire population (socializing), medical insurance makes this 



service cheaper to me, thus my demand for it increases. 



Chapter 19 

Risk and Uncertainty 



North et al. (2006) recall that humans have always faced and feared risk, to the point 
that many of the practices adopted by primitive tribes can be interpreted as insurance 
mechanisms. Societies have then build market and State mechanisms to smooth risk. In 
this chapter, we review quickly how one models risk and uncertainty with probabilities 
and how the standard theory of demand can be extended to account for randomness. We 
then introduce a basic measure of risk and characterize the way firms and individuals 
adjust their behavior when exposed to risk. We end with some more advanced results 
that are useful for the following chapters. 



19.1 A Framework for Uncertainty 

19.1.1 Introduction 

The concept of asymmetric information builds on the more general idea of incomplete in- 
formation, the fact that economic agents ignore some relevant features and are therefore 
faced with uncertainty and exposed to risk. To illustrate the importance and ubiquity of 
risk and uncertainty Table 19.1 uses data from the US Energy Information Administra- 
tion regarding the volatility of important commodities. Volatility is a statistical measure 
of how often the observed price is far away from its mean i.e., it is an assessment of the 
lack of regularity of the commodity's price. 
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Table 19.1: Spot Market Prices Annual Volatility 

Let us develop the risk concept with a simple example derived from our typical 
oligopoly framework: when a manager fails to observe the marketing strategy of his 
competitor, he lacks a crucial information and therefore faces risk; his strategy will have 
to account for this uncertainty. Although the market outcome is a deterministic function 
of the two strategies used by firm A and B, it is random from manager A's point of view. 
Take for instance the market equation of the Cournot model p = l-qA-c/B- If manager A 
is unsure of what quantity qs was decided upon by manager B, then the price becomes 
uncertain since his own decision q A does not determine completely the final price (it only 
limits the range of possibilities). 

To cope with risk, most people buy insurance e.g., life, car or housing insurance or 
pension plans; the importance of the insurance sector in developed economies attests of 
this general desire to diminish risk (cf Table 1.1). To assess how the presence of risk, 
uncertainty and asymmetric information impinges on economic choices, it is necessary 
to extend the standard theory of demand. 



19.1.2 Time and Money 

Because uncertainty relates to the future and "time is money", we have to account for 
the opportunity cost of time. We can then build the net present value (NPV) to aggregate 



a stream of revenue (or disbursement). This concepts builds on the subjective preference 
for present consumption also called time preference of the investor. 

Once we make the mental experiment of viewing today's good as different from tomor- 
row's, we can transpose the intertemporal allocation of one good between periods into the 
standard microeconomic analysis of choice (cf §2.2.1). Thus, the RMS from today to to- 
morrow should be equated to the price ratio of money at the successive times. Now, if 
money could flow freely between periods, then the latter ratio ought to be unity. Yet, 
in most cases, we require compensation to forgo today's consumption which means that 
one unit today is worth more tomorrow. Note that no market is involved here since the 
consumer is solving a cake-eating problem i.e., he allocates some amount between the 
two periods. This means that the relative price expresses his subjective preference for 
present consumption. This principle valid for two periods readily extend to many using 
a chain reasoning. 

The transposition to finance is straightforward. When an economic agent, whether a 
consumer or a firm, lends or invests 1€ for a period of time, she forgoes an opportunity for 
immediate consumption or alternative use; therefore, she requires a compensation in the 
form of an interest at repayment time i.e., she receives 1 + r where r is the discount rate. 
The market interest rate r is an (objective) average of the (subjective) discount rates of 
all investors. 1@ Typically, an individual will lend (resp. borrow) if r < r (resp. >); if there 
is an excess of lending or borrowing, the market rate adjusts to restore equilibrium. 

The discount factor 6 = ^-p < 1 is the present value (PV) of l€to be paid within one 
period of time. The PV of l€to paid within t periods of time is (1+ 1 r)t ; it is found by 
compounding the periodic discount factor. The time period can be a year, month, week, 
day, hour or minute. The relation between the corresponding rates is found by applying 
compounding. If r a is an annual rate, the associated monthly rate r m is the solution of 
(1 + r m ) 12 = 1 + r a which, in a first approximation can be taken to be r m =* r a l \2. 2@ It is 
often convenient to consider time as a continuous variable. Notice from the previous cal- 
culation that the "per second" interest rate r is so small (in comparison with the annual 
one) that the formula (1 + r) f = e tin{1+r] - e rt becomes exact since ln(l + r) ^r for small r as 
shown by Hotelling (1925) (cf. proof). 

The net present value (NPV) of a series of cash flows y = [yo,yi, y2,—,yr) occuring at 
periodic intervals is then 

NPV (y) = Y<7T IL Tt= y(t)e~ rt dt (19.1) 

f=1 (l + r) r Jo 

in the continuous time version. 



19.1.3 Probability Theory 

Up to now in this book, information was distributed evenly among actors i.e., a relevant 
piece of information was either known to everyone that might be interested by its con- 
tents or known to no-one. The key novelty of this Part on information is that some people 
have private information i.e., know more about some relevant facts than others. From 
the point of view of those in the dark, several alternatives are to be considered and in 
order to form a synthetic opinion, the individual must balance or weight them according 
to some criteria. 

Probability theory is the mathematical tool allowing the extension of economic con- 
cept to this larger world. Acquainted readers can skip this section. 

Experiment 

Consider Ann (a) and Bill (/3) playing a dice game. When the dice rolls on the table, both 
observe its realization, an integer between 1 and 6. However, if the dice falls from the 
table on Bill's side, he will be the only only to know; Ann will ignore the true realization 
and will have to form a belief. The most natural one is to attribute a probability of \ on 
each possible outcome. The rational for this belief is the following experiment: throw the 
dice n = 100 times, count n t the number of times the integer i came out and compute the 
empirical frequency h t = — . The law of large number tells us that if the dice is perfectly 
symmetrical then all six frequencies hi will converges to \ as the experiment size grows 
large (n — +oo). 

Model of the World 

The mathematical modeling of the dice experiment speaks of a random variable x. The 
basic events we consider are the possible outcomes {x = \},{x = 2},...,{x = 6} of the dice 
draw. A more complex event is {x < 3} which is the union (denoted u) of the basic events 
{x = 1}, {x = 2} and {x = 3}. The event {x is odd} is {x = 1} u {x = 3} u {x = 5}. With the AND 
condition (denoted n), we can form the event {x is even and more than 3} = {x £ {2,4,6}} n 
{x > 3} = {x £ {4,6}}. More complex events combine the OR (u) and AND (n) conditions 
with {x < 3} n {x > 4} = 0, for an impossible event or {x £ N} = D., an obvious event, which is 
always true. 

The objective probability that the dice shows the integer i = 1 to 6, denoted hi = Pr{x = 
i) is h in this particular example. More generally, a probability distribution is any vector 
h = {hi,..., h e ) = {h{)i<e such that hi > and E;<6^ = 1- The cumulative of h is H = (H;);< 6 
where Ht= Pr{x< i) = £;<; hf, for instance H 3 is the probability that the dice outcome is 



lesser or equal to 3. We speak indifferently of h or H as the probability distribution or 
law of x since hi = H t - H t -i i.e., each can be recovered from the other. 

Belief 

A probability distribution can be used to model an objective phenomenon but also the 
subjective feeling of those who observe the phenomenon with more or less accuracy. Be- 
cause Ann has no information regarding the dice outcome (the dice fell out of her sight), 
her belief h a is the objective law of x i.e., hf = | for all i < 6 (this is an application of the 
principle of insufficient reason). If the dice shows number 2, then Bill knows that x = 2 
occurred; this leads us to say that he holds the belief h^ degenerated at 2 i.e., h% - 1 and 
Vz ^ 2,h = 0. Imagine now that the faces of the dice are respectively red and green for 
even and odd numbers. If a third person Gail (y) manages to see the color of the dice, 
say red, but not the figure written on it, her belief will be a refinement of Ann's using 
Bayes's rule V i < 6, h] = p '^=i n e ^ en) i.e., h\ = h\ = h\ = 1/3 and h\ = h\ = h\ = 0. The formu- 
lation we have adopted is therefore flexible enough to describe any amount of knowledge 
regarding the outcome of the random variable, from none to all. 

Expectation 

The expected value taken by the dice is E[Jc] = Ej< 6 i*hi = \\ it is objective because com- 
puted using the probability distribution of the random variable (itself determined by the 
shape of the dice and the laws of physics). Each person present in the room where the 
dice experiment takes place forms an expectation of the value conditional on his/her be- 
lief and private information. For instance, Ann has no particular information (denoted 
0) and holds the belief h a , thus she expects E[jt|0] = E;<6*' x hf = \. Bill, who knows the 
exact value taken by the dice, expects E [x\x = 2] = £ ; -<6 i ' * n { - 2 and Gail who knows that 
the value is even expects E [x\x even] = £i<6 ixhj = 4. 

Beyond the simple dice example, we may study a discrete random variable x = {xt, hi)i< n 
whereby the value Xt is drawn with probability hf, its expected value is E[x] = Lj< n Xjhj. 
One speaks of a gamble when the values are monetary rewards (or penalties). Lastly, 
note that when the dice is rolled, it only reveals a face which may be interpreted differ- 
ently by each participant. The same holds true for a gamble since money is not appre- 
ciated in the same manner by all. To be able to use the calculus of probabilities in this 
more general uncertainty framework, we need to assume that each person associates a 
numerical value xt to each outcome (face #0 i.e., holds complete preferences over the set 
of potential outcomes. 



Economics 

Let us now generalize the probability calculus to economic phenomena. It is fair to 
say that the price of a stock tomorrow is a random variable x from today's point of view 
because the interaction of demand and supply in the stock market is the result of so many 
people that it seems impossible to compute exactly how today's price will be affected to 
form tomorrow's. The space of outcomes ranges now from to some large number, say 
1000€. Since the minimum monetary unit is the cent, there are a hundred thousand 
possible quotations; for instance ft 8 ,23 is the probability (from today's point of view) that 
the stock value tomorrow is 8,23 €. Given this very large number of possible realizations, 
it is more handy to assume that x is a continuous variable taking values in U+ = [0;+oo[. 
In that case, hg,23 will be written h{8,23) and becomes the probability that the value lies 
in a small interval centered around 8,23. 

Given our change of perspective, the mathematical object describing a random vari- 
able x becomes its cumulative distribution H where H{x) = Pr{x < x) is the probability 
that the realization of the random variable x is not greater than x. This definition ex- 
actly coincides with that of Hi seen before. More generally, a random variable takes 
values in U and its distribution function H need only be positive, increasing and satisfy 
H(-oo) = and H{+oo) = 1. 3@ For instance, the staircase function H(.) = over ] -oo;0[, 
H{x) = | for x e [|;^[ and i = to 5 and H{.) = 1 over ]6;+oo[ is the cumulative of the 
six-face dice throwing. If H is differentiable (which we shall always assume in this book) 
then h{x) = H'{x) is called the density of the distribution. The expectation of x is then 4@ 

/+oo r- 

xh{x)dx = I xdH(x) 
-oo J 

Given any utility function u, u{x) is a random variable whose expectation is 

/+oo r- 

u{x)h(x)dx= I u{x)dH(x) 
-oo J 

We say that an agent holds information 6 with respect to the underlying phenomenon 
x if he believes the random phenomenon to be distributed according to the subjective law 
H B ; the latter can be interpreted as an updating of the original objective distribution H 
upon learning 0. The expectation of u{x) conditional on the knowledge of 9 is E[u{x) \9] = 
fu{x)dHg{x) i.e., the objective distribution H is replaced by the subjective one Hg. 



19.2 Choice under Uncertainty 

19.2.1 Expected Utility and Risk 

In many instances, there is uncertainty with respect to the future. For instance, one 
of the most important decisions in our life is the occupation we engage in; shall we try 
to become an accountant, a broker, an actor, an astronaut or a civil servant? At the 
time we take the decision, we ignore if we shall succeed, if we shall persist in our choice 
or if we shall end up in an altogether different activity. Likewise, a firm keeps asking 
itself, where to invest, how much to invest or whom to employ. In economic terms, the 
uncertainty that will loom up in the future means that most financial payments received 
by economic agents are of a random nature. Let us start with the first formal study of 
decision taking under conditions of uncertainty. 

St Petersburg Paradox 

The St Petersburg paradox arises from the following game: a fair coin will be tossed until 

a head appears, say at toss #k, the player then receives 2 k €. How much would you pay 

to play that game? Although the stakes are very high, people surprisingly bet a small 

share of their wealth which, at first sight seems paradoxical. 

To see that the stakes are high, let us describe the occurrence of the first head toss as 

a random variable k (it takes integer values). The law of k is given by hk = (|) fc-1 x \ = -k 

as it involves k-l tails followed by one head; it satisfies T.k>i h. - 1 as required to be 

a probability distribution. 5@ The expected gain is E[k] - £ 2 fc % = Efc>il = +oo, yet no 

fc>i 
sensible person would bid all his/her wealth to play that game! 

To solve this conundrum, Bernoulli (1738) follows common sense in positing that 

that any increment of satisfaction Aw is proportional to the wealth increment Au> i.e., 

Au = /3/S.w for some constant j6. Resorting to Aristotle, he also argues that incremental 

satisfaction is inversely proportional to current wealth which is an expression of the law 

of diminishing marginal utility. That is to say, the /3 parameter is not constant across 

wealth levels, it must be inversely proportional to the current wealth w i.e., ft = -^ for 

some constant a. We therefore arrive at the ancestor of all first-order-conditions (FOC) 

used in economics: Au = a^-. Assuming that u is a differentiable function of wealth, it 

reads u'{w) = — and after integration, we obtain u{w) = aln{yw) where y is an integration 

constant that can be eliminated by re-scaling the monetary unit. The plot of u shown on 

Figure 19.1 is also the first graphical representation made in economics; it emphasizes 

the relation between income and utility (only the concave part is relevant). 
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Figure 19.1: Utility of wealth 



Moral Expectation 

Now, just like the monetary value of the gamble k is the sum of prizes weighted by the 
probabilities of these outcomes E [jfc], Bernoulli (1738) proposes the moral value of k to be 
the sum of prizes satisfaction weighted by the probabilities of these outcomes. Thus, an 
individual with initial wealth w who pays p to play the St Petersburg gamble will obtain 
a final wealth of w\ = wo-p+2 k (it is also a random variable) and his utility will be u{w\) = 
aln{w -p + 2 k ); his moral expectation is thus U{w Q -p) = E[u{Wi)] = T.k>i &in{w - p + 2 k ). 
Clearly, if the game was free (p = 0) then U{w ) > Zjt>i ^ln(wo) = aln(u/ )Ljfc>i jf = u{w ) 
(since the probabilities sum to unity) i.e., anyone would love to play for free. Conversely, 
if the player spends all his fortune to play (p = wq) then he expects U{0) = T.k>i ^ln(2 fc ) = 
aln{2)Y.k>i ^k - a In (4) = u(4) i.e., no one with a fortune greater than 4 would give up ev- 
erything to play the game. 6@ By the intermediate value theorem, there exists a price 
p* such that U{wo - p*) = u{wq). This is the satisfaction one gets when abstaining from 
gambling or the willingness to play the game. The maximum fraction of his wealth an 
individual would rationally agree to pay in order to play the gamble is A{w ) = ^-. Using 
a mathematical software, we find A(4) = 100%,A(40) - 16% and A(400) « 2% i.e., wealthy 
people refuse to risk much of their fortune in order to increase it further. 

Von Neumann and Morgenstern (1944) show that expected utility is an adequate 
manner of extending the rational choice paradigm to uncertainty. 



19.2.2 Subjective Risk Measure 

Insurance vs. Gamble 

When consumers display decreasing marginal utility of income, they are risk averse i.e., 
we have E[w(jc)] < u{E[x\). 1@ For instance, if you have wealth w, you'll refuse a fair gamble 
g i.e., satisfying E[g] = (your future wealth is the random variable x = w+g). This behav- 
ior constitutes the main explanation to the widespread demand for insurance. Nonethe- 
less, people continue to bet on sports, buy lottery tickets and invest in speculative shares, 
all of which are unfair gambles. To reconcile this "risk-loving" gambling behavior with 
the "risk-averse" insuring one, Friedman and Savage (1948) propose to draw the utility 
of income as shown on Figure 19.2: it displays risk-aversion (concavity) from zero until 
some level significantly greater than the current income of the individual (say twice); 
from then on, the curve displays a risk-loving attitude towards risk (convexity). 

As can be checked on Figure 19.2, the consumer prefers the certain income x to the 
fair gamble x that amounts to win or lose 5 with identical probability on top of x (so that 
E [x] = x). To understand why a lottery ticket might appear attractive, recall that lotteries 
are gambles with a quasi certain small loss (the ticket price) and a small probability of 
winning a big prize (z on Figure 19.2) that dramatically changes life's opportunities for 
the better which is why the utility at z is so large. It is then possible that the expected 
utility of the lottery ticket draws a point on the chord that lies above the utility curve i.e., 
it was rational to buy it in the first place. This occurs if either the probability of winning 
is objectively not too small or subjectively inflated. This is indeed the most frequent case: 
we tend to place a very high value on his new life style upon winning the prize. Since 
risk loving behavior involves psychological elements, we concentrate on risk-aversion 
which is more amenable to economic treatment (although the two are mathematically 
symmetrical one from another). 

Risk Premium 

Since utility is intangible (the utility function u is not unique), the positive difference 
u (E [jc]) -E [u{x)\ we observe on Figure 19.2 has no particular meaning and fail to measure 
adequately the amount of risk faced by the individual. We therefore develop a monetary 
value to express it. As we can observe, the random income x has a certainty equivalent 
x < E[x] solving the equation u{x) = E[u{x)]; the difference n = E[x] -x is the risk premium 
that the investor would agree to pay in order to avoid the risk associated with the random 
income x. This is a subjective value different from an insurance risk premium which is 
an objective market value. 

If we want to emphasize the initial wealth w wrt. the acceptance or refusal of a 
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Figure 19.2: Utility of a Gamble 

gamble g, we use x = w + g. Upon sliding the wealth by E[g], we may assume the gamble 
to be fair. The solution of E [u{w + g)] = u[w-p) is then the risk-premium p associated to 
the gamble (conditional on the current wealth). Note that our focus on gambles is in fact 
one about insurance since the act of buying insurance is a rejection of a gamble. Indeed, 
with insurance one pays the premium for sure and get full coverage for sure. Without 
insurance, one faces the risk of a large loss or saving the premium. 



Degree of Risk Aversion 

de Finetti (1952), Arrow (1965) and Pratt (1964) define an individual to be more risk 
averse than another if he refuses all gambles that the former refuses. An agent with 
monetary utility function u is called Mister u. Let the index of absolute risk aversion 



(ARA) 8C be p u (x) = - !L rfi- > 0. The following assertions are equivalent (cf proof): 

® Mister v is more risk averse than Mister u. 

® v is a concave transformation of u. 

® Mister v's ARA index is larger than Mister u's (p v > p u ). 

® Mister v's risk premium is larger than Mister u's {p v > p u ). 

A DARA is a person whose ARA index is decreasing i.e., p' u < 0. It can be shown that 
p' u < o p' u < (cf. proof). A CARA is person with constant ARA index p; her utility 
function is u p {x) = -e~ px for p > (up to an affine transformation). She is insensitive to 
wealth because E[-e-P {w+ 8 ] ] >E[-e~P w ] o E[e - ^] < 1. 



The relation between the index of risk aversion and the risk premium can be pin- 
pointed for a CARA person if the random income is normally distributed with mean m 
and variance a 2 , denoted x~~> jV {m,a). An important property of the normal law is that 
t[e x \ =e m+ 2 a . Upon observing that -px~~> jV[-pm, pa), we deduce that 

E [u p {x)\ = -E [e- p ~ x \ = - e - m P +S ¥ = u p {m- \pa 2 ) (19.2) 

meaning that the risk premium is exactly \po 2 . The ARA index p therefore represents 
twice the "risk premium per unit of variance" (for infinitesimal risk). The previous for- 
mula also tells us that an agent with CARA preferences facing risk normally distributed 
aims to maximize p- \po 2 . 

An individual displaying constant absolute risk aversion p and whose final income is 
a random normal variable with expectation m and standard deviation a act so as to 
maximize m- \po 2 . 

An IRRA person has an increasing relative risk aversion index i.e., xp u {x) /o -^f < 
p' u . Saturating the constraint defines the CRRA minority; their generic utility is u r {x) = 

1-v 

j— for 7 ^ 1 and U\(x) = log(x). In order that the individual displays risk-aversion, u must 
be concave i.e., j > 1. A larger parameter indicates more risk aversion. 

19.2.3 Objective Risk Measure 

Clearly, risk attitude depends on one's risk aversion but also on the underlying risk of 
the proposed gamble or investment. The notion of "risky investment" is central in the 
financial sphere, both to private decision makers and public regulators. Yet, until lately, 
the concept has escaped a precise definition for existing candidates such as value-at-risk 
(VaR) fail one or several coherence tests. 

Rothschild and Stiglitz (1970) introduce the concept of first and second order stochas- 
tic dominance (resp. FOD and SOD) as follow: 9 > 9 if a large result is more probable 

FOD 

under 9 than under 9 and 9 > if 6 is constructed out of 9 by replacing a value with 
a random variable whose mean is that value. The authors show that FOD amounts to 
unanimous ranking of gambles by all increasing utility functions while SOD amounts to 
unanimous ranking of gambles by all increasing concave utility functions (cf proof). In 
other words, any person with monotonic preferences (the more, the better) would pre- 
fer an investment that FOD another and among those risk-averters would all prefer an 
investment that SOD another. These comparators are thus blameless but at the same 
time so demanding that few distributions end being comparable. 9@ The recent works of 



Aumann and Serrano (2008) and Foster and Hart (2009) solve this issue. We present 
in an intuitive fashion their riskiness measures we choose to call after de Finetti and 
Bernoulli for reasons that shall soon become clear. 10@ Both are homogeneous (the risk- 
iness of twice the gamble is twice larger), sub-additive (the riskiness of the sum of two 
gambles is less than the sum of their riskiness), convex and monotonous wrt. stochas- 
tic dominance (FOD and SOD) i.e., a gamble paying more or with less dispersion is less 
risky (cf. proofs). 

In connection with the above comparability problem, Hart (2010) proves that the two 
riskiness measures each define a complete order over gambles that extends the partial 
orders of stochastic dominance when investors/gamblers belong to a natural family of 
risk-averters defined by Arrow (1965) as follows: acceptance of a gamble increases with 
wealth but decreases with relative wealth (if both gamble and wealth increase by the 
same percentage). 11@ Our intuitive presentation will use the fact that the first property 
is equivalent to DARA and the second to IRRA (cf. proof) i.e., risk-averters descend from 
DARA fathers and IRRA mothers. 

The de Finetti measure of riskiness gives precedence to the DARA property and fo- 
cuses on the people insensitive to wealth, the CARA minority (first investigated by 
de Finetti (1952)). Given a gamble g with losses and gains, 12@ the solution p g to E[e~^ p s] = 
1 is, by construction, a CARA index. 13@ Say that Spirou, Gaston and Fantasio are CARA 
guys with ARA indexes smaller, equal and larger than p . Then, Fantasio refuses the 
gamble, Gaston is indifferent and Spirou accepts it, independently of their wealth. 

Now consider Seccotine whose ARA index is p M (.). When her wealth is w* solving 
p u {w*) = p g , she is locally like Gaston. If w < w*, Seccotine is locally similar to Spirou, 
while for w> w* , she is more like Fantasio. So, may be, Seccotine should behave like hes 
alter-egos i.e., refuse the gamble when poor, accept it when rich and be indifferent when 
her wealth is exactly w*. Finally, let us examine the people most sensitive to wealth i.e., 
the CRRA minority 14@ Member Chiara has an ARA index p y {w) = jlw. According to the 
previous choice rule, Chiara ought to refuse the gamble when her wealth is w <yl p . 
So, a prudent rule, one that all the Chiaras would abide too (Vy > 1), will be to refuse 
the gamble if wealth does not reach the $ value (p g = l/p g ; it is in this sense that we 
may deem it an objective measure of the gamble's riskiness. In the case of a normal risk, 

2 

an extremely simple formula arises: <p g = |- as seen from eq. (19.2). Observe lastly an 
interesting independence property: if two independent gambles have the same risk, their 
sum also. 15@ In other words, if you go to a restaurant and consider two wine bottles from 
distinct regions whose characteristics are unrelated (uncorrelated) and which happen to 
be equally risky, then if you are ready to go for one, you may order both without taking 
more risk. 



The Bernoulli measure of riskiness gives precedence to the IRRA property and fo- 
cuses on the people most sensitive to wealth, the CRRA minority. Among these, the 
boldest is Daniel (7=1) who refuses the gamble g as soon as his wealth is lesser than <p 
solving E \og[(p g + g] = log[0 g ]. 16@ Other CRRAs, being more worrisome, will thus refuse 
the gamble if their wealth is inferior to 0„. To say something about the other people's 
attitude toward the gamble, note that they are less wealth sensitive i.e., when their 
wealth increase, they embolden themselves but not so much as CRRA people. So make 
the following mind experiment: CRRAs used to be poor and have now achieved wealth 
cp , yet they still prudently refuse the gamble. Any other person that was equally poor 
and has become equally rich has nevertheless kept an higher ARA index, thus she re- 
mains more risk averse than a CRRA fellow; therefore she will refuse the gamble too. 17@ 
The Bernoulli measure can be computed exactly for binary gambles where one can either 
lose L or win the larger B. It solves fl + |fl + |=l i.e., <p = -^. If we reinterpret this 

gamble in term of its size S and mean eS, then L = {l-e)S and B = {l + e)S so that (p„ = S^ff- 
i.e., is inversely proportional to the percentage wedge between loss and gain. When the 
gamble's distribution has a thin negative tail, the Bernoulli measure tends to the maxi- 
mum loss. 18@ This makes the measure difficult to apply for laws such as the Normal who 
have no lower bounds. 

Figure 19.3 provides a numerical comparison of the two risk measures: consider a 1$ 
lottery ticket to win a prize. For a large prize, riskiness is a decreasing concave function 
of the winning odd, slowly converging towards zero. Conversely, given a small winning 
odd A, riskiness is decreasing concave with the prize; it rapidly converges. The bold 
upper curve is the de Finetti measure, the lower one is the Bernoulli while the expected 
gain is the dashed line. The two measures behave similarly although the de Finetti one 
displays a greater prudence towards risk. 19@ 

From an investment point of view, the greatest risk is losing the entire capital while 
unexpectedly high rates of return are treated more like bonanzas. Figure 19.4 shows 
the comparative risks of an investment of 100€ which yields on average a 50€ return 
although there is a probability e e [0; 1/2] of either doubling the initial outlay or losing all 
of it. We observe that, e vanishes, the Bernoulli risk measure quickly converges towards 
the initial outlay whereas the de Finetti one keeps decreasing as risk vanishes; it even 
becomes lesser than the expected return although never nil. It thus appears that the 
Bernoulli measure captures better the risk of gambles 20@ while the de Finetti one is 
more informative for risky investments. 
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Figure 19.3: Comparing Riskiness measures over a gamble 
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Figure 19.4: Comparing Riskiness measures over an investment 

19.3 Firm Behavior under Risk 

Firm and Risk 

Large profits are highly valued by the stockholders of a firm but they do not necessarily 
lead to great monetary rewards for the managers or great burst of pride. On the contrary, 
great losses can lead to destitution or bankruptcy which are very negative outcomes for 
managers. It is therefore plausible to assume that the representative manager of a firm 
has a utility function that is increasing with profits but concave i.e., displaying risk- 
aversion. On top of maximizing profits, one would expect managers to act so as to reduce 
their risk exposure. We now explain why this is hardly feasible. 

The two classical instruments to eliminate risk are negative correlation (do not put 
all your eggs in the same basket) and the law of large numbers (try to manage many 
baskets). Regarding correlation, the very existence of a firm originates in the expertise 
and know-how of its creator regarding a particular sector of the economy. Its activities 
take place in the same region or the same markets; thus, they are subject to the same 



exogenous shocks. In statistical terms, all the revenue generating activities of the firm 
are positively correlated which means that profits will be highly variable across time. As 
for the law of large number, non financial firms tend to engage into a limited number 
of projects that are commensurate with their current size (as measured for instance 
by liquid assets); this is done to take advantage of scale economies and achieve a high 
profitability 21 ® 

Behavioral Consequences 

Sandmo (1971) studies the consequences of the manager's risk-aversion upon his pro- 
ductive behavior within the framework of the neoclassical theory of the competitive firm. 
Firm's profit is n = pq - C{q) while the manager's objective is u{n) where the utility func- 
tion satisfies u < < u'. In a risky world, the market price is a random variable p with 
mean p that is determined after the production choice q. 

The benchmark corresponds to either risk neutrality (u linear) or absence of risk 
(constant price); in that case the optimal production is q* solving p = C m {q). 22@ In the 
general case, the manager maximizes E[w(#)] where E denotes the expectation operator 
corresponding to the manager's belief regarding the distribution of p. The FOC of maxi- 
mization in quantity is 

= E[u'mip-C m (q))] 

oE[u'mp] =E[u'mC m (q)] =C m (q)E[u'm] (19.3) 

because the marginal cost depends on the deterministic quantity chosen by the manager. 
Observe now that the covariance of p and u'{n{p)) is negative 23 ® because n is increasing 
and u' is decreasing. If so then the expectation of the product is less than the product of 
expectations: poE[u'{n)] > E[z/(7r)j5] = C m {q)E[u'{n)] by (19.3). We have thus shown that 
po > C m at the optimum which proves that the optimal quantity under uncertainty is 
lesser than the certainty equivalent q*. To conclude 

| Risk exposure induces a competitive firm to reduce her output. 

A more definite conclusion can reached for the case where the manager's risk aversion 
is constant (p) and the market price is a normal random variable with mean p and 
variance a 2 . As we already showed in formula (19.2), the manager then maximizes 
U(q) = E[fi] - \p\J[n\. We already know that E [7r] = p q - C{q) while it is easy to check 
that V [ii] = a 2 q 2 , thus U(q) = p q - C{q) - \pa 2 q 2 and the FOC for maximization is 

p = C m (q) + pa 2 (19.4) 



as if the real marginal cost was inflated by the risk premium pa 2 . 

A last observation is that production will really takes place only if the expected utility 
E[u{ji)] is greater than u{-F), the utility level in case of zero production. Now, since u is 
concave, Jensen's inequality (cf footnote 26.3) tells us that t[u{n)\ < u{E[n]) = u{fi) thus, 
n > -F => p > ^p-. We conclude that 

I A competitive risk averse firm to enter a risky market only if it obtains some economic 
or extraordinary profits which can be interpreted as a required risk premium. 



19.4 Advanced Topicsf 

Optimal Amount of Risk 

We would like to know how a consumer reacts to changes in wealth and what distinguish 
the behavior of people with different risk attitudes. For instance, what is the optimal 
amount of risk for a risk averse agent when building his portfolio? Shall a very risk 
averse person avoid all forms of risk? Contrarily to intuition, Arrow (1965) answers 
negatively whatever the riskiness and the risk aversion. Indeed, everyone is risk neutral 
with respect to very small changes and the only thing that matters in that case is the 
expected return. Hence, anyone will invest at least a small amount into a risky asset 
which is, on average, more profitable than a risk-less asset such as a treasury bond. 

To prove formally this claim, consider investing 1€ into a combination of a risky asset 
whose return is the random variable f and the risk-free asset whose sure return is r . 
We ought to show that the optimal share A of risky asset is positive. Observe that the 
final wealth of the individual is w = (1 - A)(l + r ) + A(l + f) = 1 + r + A(f - r ). Letting H 
denote the law of f, the expected utility is 

UW = E[u(iu)] = I u(l + r Q + A(r-r ))dH(r) 

The FOC of maximization is 

= f/'(A) = f (r - r ) u' (1 + r + A (r - r )) (r) dH{r) 

and since U'{0) = u'[\ + r )J(r - ro)dH{r) = u'[\ + r ) (E[f] - r ), there is an incentive to buy 
some of the risky asset (A > 0) as soon as the risky asset is on average more profitable 
than the risk-less one. 

Arrow (1965) also demonstrates that a less risk averse person invests more into the 
risky asset (cf. proof). Intuition would suggest that richer people take on more risk; this 



is correct for an agent with decreasing absolute risk aversion (DARA) since becoming 
richer turns him into a less risk averse person (and we can apply the previous result). 

Mixing Action and Chance 

When an action deterministically generates an outcome, the simple observation of the 
later enables to infer with exactitude the former. An example is cultivation inside a 
greenhouse: yield q is directly related to the effort e expanded by the farmer by a relation 
such a.s q = V3e. Upon observing q, we infer that effort was e = q 2 /3. 

Outdoor cultivation, on the other hand, is subject to weather variations so that a high 
yield can either reflect fair conditions or hard work and similarly, a poor yield can either 
reflect laziness or the losses due to a storm. Generally speaking, the action (input) is 
inaccurately reflected by the result (output). A simple formalization would be q = e + z 
where e is the effort, amount of daily time spend on the field while the random variable 
z captures the effect of weather variability; as a consequence, yield is also a random 
variable q. 

To capture the interaction of effort and chance, we write H{q\e) = Pr{q < q\e) the 
probability to observe a result lesser than q given that action e was taken. The expected 
value of q conditional on the action e is then written E [q | e] = f qh{q | e) dq where h{. | e) is 
the density associated to the distribution H{. \ e). Likewise the expectation of the random 
variable f{q) is denoted E [f{q) I e] . 

The previous modeling was adapted to moral hazard where the action of the decision 
maker, called the agent, is hidden to another party, called the principal. In problems of 
adverse selection, it is a piece of information known to the agent that is hidden from the 
principal. In that case, the notation is 6 instead of e. 

Informativeness 

In matters of asymmetric information, inference is fundamental and can be presented 
as follows: "given the observation of result x, what is the probability that the action 
undertaken was #?"; it will obviously depend on the initial belief we hold regarding the 
unknown action. 

Milgrom (1981) defines result x to be more informative than result y about a higher 
action if the observer believes higher actions more probable upon observing x than upon 
observing y. This notion of good news can be related to a property pervasive in infor- 
mation economics: the action change from 6 to 9 satisfies the monotone likelihood ratio 
property (MLRP) if 

h(x\d) 



The family of distributions h{.\6) is said to satisfy MLRP if (19.5) is true for all pa- 
rameter values. It is noticeable that MLRP is stronger than FSD i.e., > => H{.\6) < 
H{.\9) (cf proof). To understand the MLRP concept, imagine that the likelihood of ob- 
serving result y is the same after the two actions i.e., h{y\6) = h{y\6), then result x > y is 
more likely to appear after the higher action i.e., h{x\0) > h{x\6). 

I When a change towards a higher action satisfies the MLRP, greater outputs are signals 
of greater inputs (but not a proof). 

To apply the MLRP concept of informativeness, we define a function / to satisfy the 
single-crossing property (SCP) if it crosses the axis only once and from below i.e., 

3y,Vx,{x-y)f{x)>0 (19.6) 

A fundamental theorem follows: if the family of distributions h{.\9) satisfies the MLRP 
and / satisfies SCP, then (cf. proof) 

0>0=>E[/(jc)|0]>E[/(x)|0] (19.7) 

Another characterization (cf. proof)is that a change in distribution of the risky asset 
increases its demand whatever the risk aversion (u) and whatever the risk-free rate (r ) 
if and only if the change satisfies the MLRP. 



Chapter 20 



Moral Hazard 



One of the oldest contractual relationship in agriculture is tenancy. Under fixed rent, 
the farmer pays the landlord a monetary rent every year for using of land while under 
sharecropping he shares the crop with the landlord; alternatively, the farmer can become 
a laborer to earn a fixed wage. The latter formula works well when parties work in team 
but if the laborer is left without monitoring, he will shirk and the yield will be very low. 
At the other extreme, the fixed rent motivates the farmer to exploit optimally the fields 
because he gets to keep all the crop; the landlord can thus ask a high rent because on 
average the yield will be high. Everything's fine when the weather is good but if the 
winter is very cold or the summer very dry, the yield might severely drop and leave the 
farmer without enough seeds to replant, feed his family and pay the rent at the same 
time i.e, agriculture is a risky activity. This may well be the reason why some people 
prefer to be laborer than farmers. 

The landlord faces a trade-off when dealing with the farmer: incentives vs. insurance. 
Sharecropping therefore appears as a solution mixing both features. On the one hand, 
any additional effort expanded by the farmer will generate a higher yield (on average), 
half of which goes into his pocket; this is a good motivation for hard work. On the other 
hand, the farmer does not have to pay a monetary rent; he is less at risk in case of a bad 
harvest. The landlord is now sharing risk with the farmer. 

The plan of the chapter is as follows. We first detail the agency relationship, the basic 
incentive problem it faces and various remedies to it. We then use managerial incentives 
to introduce asymmetric information and the resulting inefficiency. The next section is 
more formal and emphasizes unexpected contingencies. The last section presents some 
extensions regarding the possible renegotiation of contracts. We use material borrowed 
from Rasmusen (2006), Macho-Stadler and Perez-Castrillo (1996) and Boccard (2002). 



20.1 The Agency Relationship 



Profit maximization calls for the pursuit of many goals such as cost minimization, quality 
enhancement or good customer relationships. Consider then a manager in charge of one 
such activity; the attainment of the goal requires work dedication, effort and a personal 
investment. To induce the manager to perform a significant effort, the owner can link a 
wage bonus to the attainment of an (explicit) objective. Now, to generate an additional 
improvement, an additional effort is required which can be implemented by increasing 
the bonus. 

20.1.1 Framework 

A fundamental assumption underlying the contract literature is that economic relations 
take place within a well defined and smoothly functioning legal framework. When eco- 
nomic agents sign a contract, they are bound to respect its terms whenever a court of law 
is able to understand them. This has two consequences; on the one hand, the contract 
can only include clauses and obligations that are verifiable by the judge or arbitrator but 
on the other hand, there cannot be any haggling over a verifiable item. 1@ 

Preferences 

The principal (e.g., landlord) hires an agent (e.g., farmer) to exert an effort e (e.g., daily 
hours of work). Applied into the production technology, this input gives rise to an output 
of value q{e) (e.g., crop) where q{.) is increasing and concave i.e., we assume decreasing 
returns to scale. The profit for the principal is n = q{e) - w where the wage w, paid to the 
agent, can be contingent on verifiable events. To simplify the analysis, we assume that 
the preferences of the agent for effort and income are separable with U{w,e) = u{w) - c{e) 
where u is increasing concave (decreasing returns to wealth) and c is increasing convex 
(increasing value of forgone leisure time). This specification has the great advantage that 
money can be transferred between the principal and the agent to reach any participation 
constraint. We normalize the utility function so that u{0) - c(0) = 0. 

Verifiable effort 

When the effort can be contracted upon, a simple contract is [e, w) meaning "do at least e 
and you'll get w (otherwise nothing)". Recalling, that the moral hazard situation refers to 
the fact that effort is undertaken by the agent, the latter can either choose some e > e and 
derive utility u{w) - c{e) or choose some e < e and derive utility u(0) - c{e). His best options 
under the two broad alternatives are thus e and 0, as he dislikes effort. The principal will 



succeed with his objective of having the agent expand effort e if u{w) - c{e) > u{0) - c(0) = 0. 
A first result is thus the obvious observation that the salary must compensate the agent 
for the toil of working. 

Individual Rationality 

To simplify matters we assume that a single principal is facing a multitude of potential 
agents to whom she makes a "take-it-or-leave-it" offer. This extreme formulation is jus- 
tified by the excess demand for the position that turns potential agents into Bertrand 
competitors ready to accept no more than their opportunity cost u to get the job; one can 
think of u as a minimum wage below which agents prefer to stay at home. This setting 
apparently opposes an all-mighty capitalist to an harmless worker; this is only done to 
ease the mathematical analysis. As we shall see later on, the opportunity cost acts as a 
slider that enable to share the benefits of the relationship in any proportion between the 
two parties (cf §20.1.2 on bargaining). Hence, when u is large, it is the agent who has 
most bargaining power. 

To convince the agent to sign the contract in the first place, the principal must offer 
him an expected utility at least as great as his opportunity cost u. This individual 
rationality (IR) condition is in fact a participation constraint: 

u{w) -c(e) > uo w > w{e) = u 1 [u + c{e)) (20.1) 

Since u and c are increasing, so is w, meaning that the higher the effort one wishes to 
implement, the higher must be the compensation. 

First Best 

The principal can now maximize her objective n = q{e)-w over contracts {e, w) satisfying 
the participation constraint (IR). Since she likes money too, it is optimal for her to satu- 
rate the participation constraint (20.1) by restricting attention to contracts [e, w{e)) i.e., 
pay the minimum acceptable wage. The principal's program thus becomes 

maxq(e)-w(e). (20.2) 

e — 



The FOC for (20.2) is 

q'=u/ = ^oc' = q'u' (20.3) 

i.e., the marginal utility of taking one minute of rest d equals the marginal value of 
money u' times the additional production q' of one additional minute of effort. Since q 



and u are both concave, the RHS of (20.3) is decreasing while cbeing convex implies that 
the LHS is increasing, there is thus a unique solution e* called the first-best effort. The 
first-best contract is [e*,w{e*)) and yields the final profit n* = q[e*)-w[e*). 

20.1.2 Moral Hazard 

Most often, effort is too complex to be monitored closely 2 @ but there does exists a minimal 
effort e that can be required from an employee i.e., failure to perform at this level is a 
juridically acceptable reason for firing him without salary 3 ® In such a situation, the 
wage scheme becomes flat i.e., the agent is paid a salary wASince his utility is u{w) - 
c{e), he has no incentive to work harder than the contractual minimum, thus he will 
perform e. The minimal wage satisfying the participation constraint is then w{e) and the 
principal earns n = q{e)-w{e). The literature often assumes e = to simplify matters but 
it is important to remember that the principal has also an opportunity cost of entering 
into the relationship with the agent which plays a symmetrical role to u. 

The very idea that moral hazard is an issue translates into e < e* , the fact that the 
principal would like the agent to perform more than the minimally contractible effort. 
This also implies that n<n* (recall that the profit q{e) -w{e) is concave with a maximum 
ate*). 

Verifiable Output 

Although effort is not verifiable, it can be the case that the output produced can be 
contracted upon. Since there is a one-to-one relationship between effort and production, 
any effort e is uniquely associated to the output q = q{e) and to the compensation w = 
w[e). Let us then consider the following contract "produce at least q and i'll pay you 
w+ 1€, otherwise nothing". It is readily observed that upon accepting this offer, the agent 
chooses to perform exactly e. As he obtains a utility level u [w[e) + l)-c(e) > u [w[e))-c{e) = 
u, he will accept the contract in the first place. Hence the principal can finely tune the 
contract (reduce the l€ bonus) to leave no more than u to the agent. Choosing e = e* 
implies that the first-best can be achieved although effort is not contractible. 

The previous scheme is however quite sensitive to the precise observation of q{e) and 
would fail to work properly if risk was taken into account. A standard piece rate scheme 
can achieve the same outcome; it consist of a base salary w and a piece rate factor ft so 
that w{q) = w + ftq. Upon signing such a contract, the agent has utility u[w + /3q{e)) - c{e) 
and is thus lead to expand the first best effort e* whenever the principal sets ft = 1 i.e., 
makes the agent the residual claimant of the activity. The base salary is then set so as to 
satisfy the participation constraint i.e., u[w + q{e*j) = u + c{e*) o w = w[e*) - q{e*) = -n*. 



Franchising 

Up to a slight change in the sequentiality of payments, the previous scheme is identical 
to franchising or "selling the store". Here, the agent pays the principal n* up-front (or its 
expectation if there is uncertainty) and then becomes residual claimant of the activity. 
Although effort may not be contractible, the agent's wage is now w{e) = -n* + q{e) so that 
his utility is u{q(e)-n*) - c{e). The optimal effort solves the first-best FOC (20.3) and 
since q(e) -n* = w[e) at e* , the solution is the first-best effort e*. The obstacle to such 
a clever method is limited liability i.e., in most cases where moral hazard matters, the 
agent is poor compared to the principal and cannot pay upfront the potentially large 
amount n* . 

Bargaining power 

It is realistic to assume that in their first encounter it is the principal who makes a "take- 
it-or-leave-it" offer to the agent and not the reverse. Once the agent has been working 
for her a while, he has acquired a valuable human capital for the firm and thus may be 
able to dictate his conditions for the next period. 

In that case, the agent will not buy the store but offer the principal the contract [e, w) 
maximizing his own utility u{w)-c{e) under the constraint that the principal accepts the 
offer. Turning down the agent, the principal could do the job herself or hire a new agent 
but in any case she would miss the participation of the old and more experienced agent 
and she would end up earning a lower profit n_ < n* ; this level n therefore represents her 
opportunity cost. The offer of the agent must satisfy q{e) - w > n for it to be accepted by 
the principal (participation constraint). Like the principal, he likes money, thus proposes 
the minimally acceptable offer, that saturating her participation constraint. 

The agent's utility is thus u{q{e)-n) - c{e) and his optimal effort is again efficient. 4 ® 
Under this scheme, the agent reaches a utility level of u* > u since all gains of the eco- 
nomic activity are passed from the principal to him (by duality n<n* o u < u*). Hence, 
by sliding the reservation utility from u to u* , we are able to implement any surplus dis- 
tribution between the principal and the agent (cf §2.4.3 on bargaining). The "take-it-or- 
leave-it" hypothesis is thus best seen as a tool to simplify the analysis without restricting 
its generality. 

Risk and Uncertainty 

In most basic moral hazard settings, the agent applies effort on some project or activity 
which later on yields a result or profit. It is clear that many unpredictable events like 
the weather or macro-economic shocks interfere with the work of the agent to increase 



or decrease the magnitude of his production, to change a winning project into a losing 
one. Hence the one-to-one relationship between effort and output is lost. This means 
that incentives based on output force the agent to bear risk: although he worked hard, 
the resulting performance might be adverse so that pay might be lower than expected. 
Whenever the agent is risk-averse, the use of incentives becomes more costly to the 
principal because she must compensate the agent for his risk bearing. 

20.2 Managerial Incentives 

Before presenting the general model of moral hazard, we develop here a simple model 
that captures the issues of risk and incentives. We show how contractual terms are 
optimally distorted towards lower effort to provide the agent with some insurance. The 
resulting optimal contract is called second-best because the associated profit falls short 
of the first-best level, the difference is called the agency cost of moral hazard. We then 
turn to a series of extensions that account for realistic features absent from the base 
model. 

We first derive the optimal incentive scheme as a decreasing function of the uncer- 
tainty surrounding the agent and his degree of risk aversion. Next, we show that if the 
observable performance measure on which incentives can be build is loosely aligned with 
the principal's real objective then incentives are watered down to impeach the agent to 
"game" the scheme. Next we analyze succinctly relative performance measure and show 
that evaluation against peers, although fundamentally a worse instrument than piece 
rates, does eliminate firm specific risks unlike absolute performance measures; for that 
reason, "employee of the month" and promotions can become optimal incentive schemes. 
The next step is job design and whether people ought to work alone or in team, so to say. 
We show that team work dominates individual accountability when uncertainty is not 
so much of an issue and technological complexity makes it difficult to trace specific effort 
from outcomes. Lastly, we deal with implicit incentives or career concerns, the fact that 
one works harder at the beginning of his professional life to signal high ability and enjoy 
later on a better position or earnings. 

20.2.1 Individual Compensation 

To simplify our study of uncertainty we make a number of simplifying assumptions in 
this section. The equivalent monetary cost of effort is quadratic and after normalizing 
adequately the measure of a unitary effort, becomes c{e) - \e 2 . The manager has a zero 
opportunity cost (u = 0) and constant risk aversion p; as we saw in §19.2.2 (eq. 19.2), if he 



earns a random normally distributed income x, he maximizes u = E[x] - f V [x]. We posit 
a constant productivity y of labour and an additive noise, blurring the relation between 
input and output i.e., q = ye + e. We further assume that the noise e follows a centered 
normal law with variance a 2 . To conclude, the contractible performance measure is an 
imperfect signal of effort BUT, in this section, the risk or randomness is independent of 
effort. 

The wage scheme used by the owner consists of piece rate or bonus factor ft and a base 
salary w, so that total (random) wage is w = w + fiiye + e) leading to an expected utility of 2 

u(e) =E[W]- \pV[w] - \e 2 = w + Pye-\e 2 - \pa 2 p 2 y 2 (20.4) 

Observe that a greater bonus factor /3 increases the manager's risk exposure; the last 
term in (20.4) is the corresponding risk premium which is independent of effort due to the 
assumed additivity in the performance measure. The optimal level of activity maximizes 
u by equating marginal wage /3 to marginal cost of effort i.e., e{p) = fly. As intuition would 
suggests, effort increases with the bonus factor and productivity. Plugging the optimal 
activity into (20.4), the agent's expected utility simplifies into u{e) = w- \f3 2 y 2 {po 2 - 1). 
The agent is willing to accept the wage scheme if this is positive thus the principal sets 
w{/5) = \fi 2 y 2 {po 2 - 1) so as to saturates this participation constraint. 

Summarizing, a bonus p motivates the manager to expand effort e = /3y at expected 
cost 

Me) = w{p) + Pye(P) = |(1 + pa 2 )/5 2 y 2 = |(1 + pa 2 )e 2 (20.5) 

We can now solve for the optimal contract of the firm. Since expected output Q = E[q] 
is linear in effort with Q = ye, the cost function is C{Q) = +P 2 Q 2 - Profit 5@ n = Q- C{Q) is 

2 2 

thus maximized for Q = - r ? , yielding a maximum ft - t^t- — ?r. 

To relate these findings to the first-best seen in the previous section, recall that the 
true cost of effort to the manager is c{e) = \e 2 < w{e). There is equality with (20.5) if 
either a = (no risk) or p = (risk-neutrality). The objective at the first-best is n* = y 2 l2 
so that the agency cost of moral hazard, expressed in percentage, is ^^ = ^ g 2 . Notice 
that the quantity objective might be replaced by a quality one or a reduction of marginal 
cost. In all cases, the presence of moral hazard generates an additional diseconomy of 
scale since the marginal cost of reaching the objective is an increasing function of the 
objective level. 



20.2.2 Misaligned Incentives 

The ultimate objective of the firm is profit but since no single person makes a definite 
impact upon it, the yearly profit is a poor signal of employees' past efforts; it is therefore 
not a good instrument to use in order to motivate hard work. The numerous elements 
that concur to profit are the quality sales and cost of products but also human capital ac- 
quisition or cooperativeness among employees. Most of these dimensions are too complex 
to be contracted upon, so that workable objective performance measures only display a 
small positive correlation with the objective that the firm may desire for a particular 
employee. We shall see, within the previous model, that explicit incentives are optimally 
low powered to avoid distorting the agent's effort towards unnecessary goals. More gen- 
erally, multiple instruments such as yearly cash bonus, promotion or job design enable 
to span better the fundamental objectives of the firm and avoid misaligned incentives. 

Suppose that the valuable output is q = e + e (assuming unitary productivity) but that 
the observable contractible performance measure is x - de + e where 6 is an index of the 
divergence between the principal and the agent objectives. Prior to the relationship, this 
parameter is unknown to all; we assume variance a 2 g and a unitary mean so that, on 
average, the performance measure is well designed. Once the agent starts to work for 
the principal, he becomes knowledgeable about the firm's technology and thus learns 
the true realized value of 6 i.e., whether his effort tends to over or under emphasize the 
observable performance measure (with respect to its real impact on profits). 

Based on our previous model, it is immediate to see that a bonus /3 based on x leads 
the informed employee to choose effort e = f)6 and expect total utility u{e) = w- \fi 2 {po 2 - 
d 2 ). The base salary guaranteeing ex-ante that the agent is willing to sign is thus w = 
E [\p 2 {pa 2 - 6 2 )] = \p 2 {po 2 -a\ - 1) as E[6 2 ] = ct 2 + E[6] 2 (recall that ex-ante the firm ignores 
the value of 6). Since expected effort is E[e] = ft, the expected cost for the principal is then 
an extension of (20.5): 

Me) = w + pE[x] = w + /3E[0e] = w + /3 2 E [6 2 ] = - (l + per 2 + a 2 e ) e 2 (20.6) 

Thus, even for a risk neutral agent (p = 0), the principal's adjusted cost of effort is in- 
creased by the agent's ability to exploit his insider information at her expense (a be- 
havior known as "gaming"). Whatever the value of the output, optimal incentives will 
be deliberately muted and optimal effort will be reduced further (as compared to the 
previous analysis since marginal cost is now higher). 



20.2.3 Rank-Order Tournaments f 

Instead of relying on an absolute performance to reward an agent, one can use a relative 
performance by comparing an agent against a yardstick such as the previous year sales 
or against peers both within and outside the firm. Rank order refers to the fact that 
the margin of winning does not affect the level of compensation. In many instances, 
it is easier to directly compare two outcomes (make an ordinal ranking) rather than 
assess them against an exogenous scale (make a cardinal ranking). There is thus a 
metering advantage (lower cost) of relative performance from being an ordinal measure 
as opposed to piece rate which is a cardinal measure. Relative measures are also more 
flexible in the sense that if the environment changes e.g., technology improves, then 
all the peers are likely to be affected in the same manner so that incentives remain 
unaffected quite differently from the case of a piece rate scheme whose absolute bonus 
become automatically distorted. 

Following Lazear and Rosen (1981), we analyze the simple setting where two agents 
with current salary w compete for a higher rank position paying a bonus p\ The first 
agent's wage is then the random W\ = w + p / i qi>q2 which is a binomial variable. The 
probability of winning is E [q x > q 2 ] = t[li-e 2 > e 2 -e\] = H{e\ - e 2 ) where H is the law of a 
centered normal variable with variance 2a 2 . The expected wage is thus w + (5H{e\ - e 2 ) 
while its variance is fi 2 H{e\ - e 2 ){\ - H{e\ - e 2 )) which, as a function of the bonus, is less 
risky than a piece rate scheme. As before, each agent has constant risk aversion p, thus 
maximizes 

u{ei) = w + /3H{e l - e 2 )-2 P p 2 H{e 1 - e 2 )(l - H{e x - e 2 )) - \e\ 

The FOC of optimal effort is p"fa(<?i - e 2 ) - e x + h{e\ - <? 2 )(1 - 2H{e x - e 2 )). Since all the 
setting is symmetric, so is the equilibrium, hence the FOC reads e\ = e\ - fih{0) as H{0) = 
\. Recalling that H is the law of l\ -e 2 , one can compute h{0) = j^T- Hence, the greater 
the noisiness of the performance measure, the lower is the induced effort. To induce high 
effort, it is thus necessary to set a large bonus and force the agent to support a greater 
risk. As we shall now see, tournament ends up being more costly. 

The salary is set to solve the agent participation constraint u{e*) = i.e., w = -\fi + 
\p(!> 2 + \{e*) 2 . The total cost of achieving effort level e per agent is thus c{e) = w + \fi = 

lPmP + ^ e2 = ^ l + 4npa2)e2 as W = 47l(j2 - 

Evaluation against a yardstick q is a form of contest quite similar to the rank order 

tournament yielding a twice lower risk premium. 6@ Comparing the reward schemes is 

now easy in our setting since we only need looking at the risk premium factor in each 

cost formula. Since 4n « 12.6, the risk premium is greatest under tournament, then twice 

lower under yardstick and lowest under piece rate. 7@ As soon as uncertainty matters, the 



tournament (relative evaluation) is the worse method of providing incentive, followed by 
yardstick while the direct evaluation enabled by piece rate is best. Note though that 
all are inefficient since moral hazard cannot be eliminated as it must be traded against 
insurance. 

However, if the noise blurring the observation of a worker's effort is made of a firm 
specific component plus an idiosyncratic component i.e., q\ = ei + rj + ei then the vari- 
ance factor in the piece rate cost is a = a 2 + a 2 while there is no change in the case of 
a tournament because the firm specific components cancel out. It is now clear that the 
tournament is optimal whenever the firm specific noise is greater than the idiosyncratic 
one i.e., a 2 > {4n - l)a 2 e . Notice that the yardstick evaluation is not immune against firm 
shocks. This results rationalizes the use of tournaments for managers because their 
effort relate to firm wide strategies who are particularly sensitive to external shocks. 
Workers at lower levels of the hierarchy operate in an environment relatively safe from 
external influences so that piece rate schemes are optimal for them. Lastly, our previous 
observation that tournaments generate more income variance than piece rate schemes 
(na 2 e 2 vs. a 2 e 2 ), means that "risk averters" prefer piece rate schemes while "risk-lovers" 
prefer tournaments. This rationalizes the fact that "risk averters" occupy lower levels 
of the hierarchy and are paid according to their productivity (with a piece rate scheme) 
while "risk-lovers" choose risky occupations in which few win very large prizes. 

20.2.4 Multi-Tasking 

Corts (2007) studies team vs individual compensation when agents must perform several 
tasks (cf §13.1.2). We consider two functions that apply to two product lines a and (5; 
together they generate four activities. At a given level of employment, say two managers, 
an assignment scheme must be devised to perform all four activities. Assuming high 
returns from specialization, each of the two agents performs two tasks. 

The effort level in any activity has a cost c{e) = \e 2 and yield a one-to-one monetary 
return for the owner i.e., n = e\ + e 2 + e 3 + e 4 . The performance measure of the two products 
are q a = y l ei+ y 2 e 2 + e a and qp = y 3 e 3 + y 4 e 4 + ep where e a and ep are independent noises 
blurring the observations. We cannot normalize the productivities to unity as before 
unless they are identical, thus we shall later on assume j x = y 3 = 1 and y 2 = y 4 = y > 1 
where the latter parameter measures the severity of the multi-task problem. As usual, 
the wage scheme includes a base salary and bonuses related to the observable outputs 
i.e., w = w + aq a + fiqp. As before, the salary w will be tuned to meet the participation 
constraint. Assuming that the managers are risk averse with (identical) coefficient of 
absolute risk aversion p, their gross expected utility is u{e) = w- \po 2 [a 2 + /3 2 ) + a(y : ei + 



y 2 02) + P (73^3 +74^4) which is an immediate extension of (20.4) except for the omission of 
effort costs. 

If an agent works in a team, he must provide the pair (01,03) or (e 2 ,e 4 ) 8@ while if 
he works alone on a product line, he must provide the pair (01,02) or (03,04). In team 
work, the optimal level of activity 01 maximizes ay x e x - \e\, thus he chooses e\ = ay x and 
by symmetry 03 = /3y 3 . Efficiency commands a unit effort in each activity; this can be 
achieved by tuning the appropriate bonus factor for that agent, whatever the produc- 
tivity 7 may be in each activity. This is the advantage of team work as we shall see 
right-away Since the owner must compensate the agent for the risk burden, her ex- 
pected cost is \ (l + pa 2 ly\) e\ as in (20.5) up to the productivity factor y x . The profit over 
the first activity is thus 

Tt\ = 01 - - (l + pa 2 ly\) e\ = a-Yi - -{pa 2 + y 2 x )a 2 

so that the optimal bonus factor is a = — p— ? and the maximum profit reduces to n x - 

2 2 

\ — p— 2. By symmetry for the third activity, n 3 = \ — p— j. The case of the other agent is 

identical. Using the normalization of the productivities and summing the four activities, 



r 



.2 



we obtain Ilteam = — ttt h > 

Let us study now individual line work. The a product manager chooses effort 01 to 
maximize ay^i - \e\, hence e x = aj x . Observe now that his optimal choice for the other 
task is 2 = aj 2 because his pay depends only on the a output. 9@ Incentives are thus 
worsely aligned but there is also less risk exposure since /3 = for that the a product 
manager. The profit over product a is thus 

n a = 01 + 2 - I {e\ + e\) - \pa 2 a 2 = a{y x + y 2 ) - \ {pa 2 + rl + y 2 2 )a 2 

so that the optimal parameter is a = — r 2 1+ l 2 2 and the maximum profit reduces to n a = 

\ y 2 1+r 2 2 2 - Using the normalization of the productivities and the perfect symmetry for 

the other product, we obtain total profit n inc j = ^ 2 + J x+ 2 - One can show that 10@ nt ea m > 
Hind if 7 > 3 + 4pcr 2 . We can conclude: 

When the degree of risk aversion is low, or when signals of effort are informative or 
when the multi-task problem is serious, team work dominates individual accountabil- 
ity. 

In terms of firm organization, this suggest reasons when functional rather and divi- 
sional structure is preferred (cf §13.1.4). 



20.2.5 Career Concerns 

A manager can be motivated by explicit incentives such as performance based bonuses 
or promotions, but also by the implicit incentives channeled through the labor market. If 
innate ability which is highly variable, was easily observed (could be proved), managers 
would always be paid according to their productivity. Education is one way to signal 
this personal characteristic because of the differential cost to acquire human capital (cf 
§21.1.3 on signaling). However people change jobs several times (on average) during 
their first decade of professional life, thus the market valuation of their ability comes to 
dominate that incorporated in the diploma. The fact that the market is able to infer a 
good estimate of real productivity out of observable performances gives rise to a different 
kind of signaling: people work hard as juniors in order to achieve great performance, 
signal their worth and ultimately achieve better senior positions. The incentive at work 
here is implicit because the market does not conscientiously design the inference process 
revealing abilities and it is also dynamic as future pay is based on past performance. 
We follow here the seminal contribution of Holmstrom (1982b). On top of the moral 
hazard seen before (effort is delegated), the fact that ability is a private information to 
the worker introduces an adverse selection dimension to the problem. 

Abilities in the population are measured by a zero mean index so as to capture su- 
perior or inferior individual ability. The agent whose ability is 9 expands effort e in his 
current occupation. We keep referring to the market or his future employer as the prin- 
cipal. The observable performance measure is q = 9 + e + e where e is a white noise as 
before. From the point of view of the principal, the ability is a centered normal variable 

a 2 

with variance a 2 B . Let us denote t = 2 ° 2 < 1, a (positive) measure of the informativeness 

8 

of the signal q upon the ability 9. Upon observing an output q and anticipating an effort 
level e, the principal derives a realization of the random variable 9 + e, he thus infers a 
new estimate of the underlying ability E[0|g,e] -x{q-e) computed using the statistical 
laws of the involved random variables. 

Ex-ante, during the junior part of his working life, the agent with ability 9 must 
choose how much effort to invest in the signaling activity; he knows that his ex-post 
wage, as a senior manager, will depend on the effort e anticipated by the principal and 
on the noise e (trough the performance q), his expected wage is thus E £ [w] = E e [E [9\q, e\] = 
r{9 + e-e). Given the quadratic cost of effort \e 2 , the optimal effort is e* = r which 
is increasing in the market's ability to infer productivity out of the observable perfor- 
mance. 11 ® At the limit where ability is fully observable i.e., r = 1, the effort is efficient 
since welfare is here q - c{e). 

Since ability is unknown to the market, there are returns to effort because perfor- 



mance influences the ability perception. The agent thus tries to bias the process of in- 
ference in his favor. However, in equilibrium the market anticipates the effort level and 
adjusts the output measure accordingly i.e., no one can fool the market. Yet, the agent 
is trapped in supplying the equilibrium level that is expected of him, because, as in a rat 
race, a lower supply of labour will bias the evaluation procedure against him. Including 
many periods of repeated effort and performance reveals that effort gradually decreases 
as the end of the professional life approaches because the market inference has been 
refined to the point of identifying exactly the ability i.e., there is nothing left to signal 
which means that effort becomes useless. 

20.3 State of Nature Approach 

In this section, the valuable output is q = O(e,0) where 6 is a choice of "Nature" such as a 
macro-economic shock. In the case of automobile insurance, 6 can take one of two values, 
"accident" or "none" and e is the cautiousness taken by the driver (e.g., respect speed 
limits). The presence of uncertainty does not by itself ruin our hopes to get insurance. 
If the insurer could install a black box registering every driving decisions, then effort 
(driving care) would be contractible because it could be verified after the occurrence of 
an accident; the first-best would be achieved. What is reality for airplanes and trucks is 
not yet implemented for individual cars so that moral hazard is an issue. Another basic 
example is when the agent is the manager of a firm. The parameter 6 can then reflect 
the strength of demand for the good he is in charge of developing and producing and e 
is the amount of time dedicated to this project. 12@ It is quite clear that the overall profit 
of the firm will depend on both the market demand and the care with which the agent 
managed the project; the interconnection between the two components is so complex that 
there is no way to decipher the whole sequence of actions that the agent took to judge 
whether he worked hard or not. This means that effort is unverifiable and since it cannot 
be inferred from the output, it becomes a free decision for the agent. To repeat ourselves, 
the asymmetry of information in settings of moral hazard forces the principal to delegate 
crucial decisions to the agent. 

20.3.1 The Second Best program 

As we just argued, in the majority of real cases it is too costly to perform an audit after 
production has occurred to discover how much the agent has worked (and pay him ac- 
cordingly). Hence, the principal can only propose a wage contingent on the production 
level, knowing that the agent will choose its effort to maximize his expected utility (over 



the distribution of the state of nature). Our task in this section is to characterize the 
optimal such contract; Figure 20.1 is the game tree describing the relationship between 
the principal and the agent. The players are the principal P, the agent A and a special 
player which is not strategic, nature N. The first player to move is the principal who 
offers a contract out of a large range of possibilities (represented by the cone); it is a 
rule w{q) stipulating a wage for each level of output. Then the agent either refuses or 
accepts the contract and immediately after he chooses his effort (in our simple sketch he 
has only two choices, high and low). Later on Nature decides whether the state of the 
world is favorable or not which gives us the final output q = O(e,0) jointly determined by 
his effort and the state of nature nature. 
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Figure 20.1: Principal -Agent Contractual Relationship 

The Principal-Agent game of Figure 20.1 is solved by backward induction, a concept 
seen in §2.4.2. The agent's ex-post utility (at stage 5) is u[w{q)) - c{e) where q = O(e,0), 
hence the agent's expected utility at the time where he has to choose his effort (stage 
3) is U{w,e) = Eg [u{w{<&{e,0)))} - c{e). Being rational, the agent will pick the effort e{w) 
maximizing v{w,e) over the available choices. Moving one step backward (stage 2), the 
rational agent will accept the contract w{q) if and only if it guarantees him more than its 
outside option i.e., U{w, e{w)) > u. Provided that w is attractive enough to be accepted by 
the agent, the principal's ex-ante profit (at stage 1) is U{w) = Eg [<${e{w),0) - w{<&(e{w),0))}. 
This cumbersome formulation is presented as the following optimization problem: 13@ 

f U(w,e)>u (IR) 

maxE e [®(e,6)-w(®(e,6))] s.t.^ - (20.7) 

wi.) { Ve?e,U(w,e)>U(w,e) (IQ 

where {IQ is the incentive compatibility constraint stating that the agent always chooses 
the effort best for him given the contract w (.) he signed and {IR) is the individual ratio- 



nality constraint that guarantees participation. 

Since this program is very difficult to solve we study a simpler version and assume 
that the agent manages a risky project yielding either success (value 1€) or failure (value 
0€). A contract is now a pair {wo,p where /3 is the bonus for good results (e.g., stock 
options for managers). The effort positively influences the probability of success but 
with decreasing return to scale i.e., the probability q{e) is concave increasing. We assume 
separability of effort and income with u{e, w ) = u{w )-c{e) where u is concave increasing 
(monetary risk aversion) and c is convex increasing. 

20.3.2 Resolution 

Let us first find out which contracts induce the agent to take some specific effort e. The 
expected utility of the agent is (1 - q(e))u{wo) + q{e)u(wo + p)- c(e) hence the optimal ef- 
fort solves u(wo + p) - u[wo) = ^M. As c" > 0, q" < and u' > 0, there exists fi{wo,e) such 
that u[wo + f}(Wo,e)) - u{w ) = ^M. Independently of risk aversion, the greater the effort 
wanted by the principal, the greater the bonus he has to give i.e., -J^ > 0. Among the 
contracts implementing e, the least costly solves 

c'(e) 

u + c{e) = {l-q{e))u{wo) + q{e)u{w + p) = u{w ) + q{e) 

q'{e) 

=> u>q= w(e) = u' 1 \u+c{e) < w{e) = u' 1 [u + c[e)) (20.8) 

where w' > and w" > (composition of two convex functions). The difference with w{.) 
characterizing the participation constraint without moral hazard (cf eq. (20.1)) is the 
fraction term. The profit for the principal when implementing e is then 

Yl(e) = a-q{e)){0-w{e)) + q{e)[l-w{e)-p{w{e),e)) 

= q(e)-w(e)-q(e)p(w(e),e) (20.9) 

In the first-best regime where effort is contractible, the profit with a contract [e,w{e)) 
is q{e) - w{e). The difference in the presence of moral hazard is twofold: one the one 
hand, there is the costlier bonus /3 [w{.e], e) that must be given to the agent to induce him 
to perform e while on the other hand, the base salary w is cheaper. The last step for 
the resolution of (20.7) is to choose an optimal e to maximize (20.9). The optimal effort 
solving program (20.7), known as the second best effort, is smaller than the first-best one 
and we may conclude that the presence of moral hazard forces the principal to distort 
the optimal contract towards less effort since incentives are now more costly to provide. 



If the agent is risk-neutral, the bonus implementing an effort e is independent of the 
salary, it is /3(e) = ^M while its associated salary is w{e) - u + c(e) - ^fep • In that case, 
the principal's payoff is 11(e) = q{e){\ - p) - w{e) = q{e) - c(e) -u which is the first best. 



Pay for outputs, not inputs 

Mirrlees (1974) shows the following paradox: if there existed an outcome q that would 
occur with zero probability after the first-best effort e* but would occur with positive 
probability for every lower effort then the principal would successfully offer the agent the 
following contract "I pay you w{e*) + 1€ but if q appears you will be executed". Backward 
rationality tells us that e* is optimal under acceptance of the contract and that accepting 
is optimal in the first place. The purpose of this silly example is to show that rewards 
should not be linked to output but to input. Hence if a lazy effort increase the occurrence 
of some output then the contingent wage should be very low to deter laziness. Likewise 
if the optimal effort yield more often some output then the contingent wage should be 
very large to encourage this effort. 

20.3.3 The Mirrlees Approach f 

Holmstrom (1979) provides a powerful yet simple characterization of the optimal con- 
tract using the Mirrlees (1974) approach of turning random variables into distribu- 
tions admitting densities. The production is a random variable q whose law is H{e, q) = 
Pr[q<q\e) depends on the effort e previously chosen by the agent. The density is 
h[e, q) = dH Q q q) ; we denote h e = ^. 

From the agent's ex-post utility u[w) - c(e), we deduce the ex-ante utility conditional 
on the wage scheme w. 

U{w,e)= I u{(x){q))h{e,q)dq -c(e) 

Moral hazard means that the agents chooses effort e to maximize U{w, e) i.e., satisfies the 
incentive compatibility constraint (IC) 

c'(e) = I u {(x){q)) h e {e, q) dq (20.10) 

Furthermore, he will accept the wage scheme w only if U{a), e)>u (IR). 

Likewise the ex-post utility of the principal being n(q- w), she expects U{cd, e) = Jn{q- 
w{q))h{e,q)dq when the agent has accepted the scheme id. Her objective is thus to maxi- 
mize U{cd, e) under the above (IC) and (IR) conditions. The Lagragean (with non negative 



multipliers A and /i) is 

££ = \ n{q-(D{q))h[e, q)dq + X[U{co, e)- u} + p c'(e)- / u[o){q))h e {e,q)dq 

and can be maximized point-wise in the variable w i.e., by solving |=| = to obtain 

7i'{q — w) h e 

-n {q- w) + Xu {w)h{e, q) + uu {w)h e {e, q) =0o = X + u — {e, q) (20.11) 

u'{w) n 

A first best situation, one without moral hazard, is found by maximizing U{co, e) + 
XU{u), e) in w and e for some multiplier X (this is equivalent to maximize n under the (IR) 
constraint). The optimal risk sharing is found by a point-wise maximization as above; 
the FOC for every q is 

Tt'id — W) 

— - = X (20.12) 

u'{w) 

and the solution is an increasing 14@ function a)*(q). The efficient effort e* solves 15 ® 

c\e) = f luioiq)) + n ^-™WA he{et q) dq (20.13) 

To be able to compare the risk-sharing equations (20.11) and (20.12), Holmstrom 
(1979) first proves that fj, > 0; then using the fact that the RHS n ^ZT is increasing in w, 
he can deduce that a)* (.q)<6j{q) o h e ^.0. The meaning of this result is that higher effort is 
promoted: indeed, at all productions whose probability is increased by effort (h e > 0), the 
wage increase faster than the first best one while the reverse holds true at all productions 
whose probability is decreased by effort (h e < 0). If we further assume that the ratio ^ is 
increasing in q then the second best u){q) is increasing. 16 ® This motivation towards hard 
work is nevertheless countered by the need to share risk so that in the end the second 
best effort is inefficiently low. Indeed, comparing (20.10) and (20.13) reveals that e < e* 
if fn{q-to{q))h e {e, q)dq>0 which happens to be a consequence of yu>0. 17@ 

20.3.4 Automobile Insurance 

Since the second best program (20.7) is difficult to solve we present Rasmusen (2006)'s 
simple application to automobile theft insurance where the analytical solution can be 
derived. Assume that effort has two levels, care e and no-care e generating theft proba- 
bilities of \ and | respectively. The utility of the car is 12 and the owner is risk averse. 
An insurance contract C = ip,8) is a premium p and a reimbursement 5 of damages; the 
absence of contract is Co = (0,0). In the absence of an insurance contract the expected util- 



ity under care is u[Co,e) = ^p - c where c is the disutility of taking care (e.g., checking 
every night that doors are locked). Without care the utility is u{C , e) = =^. Assuming c 
not too large, the non-insured car owner will optimally take care. 

Observe that a contract can also be labeled C = {wnt,u>t) where wnt = 12- p and 
wj = S - p are the income level of the owner in the two possible states of nature (theft 
and no-theft). No insurance can thus be labeled C = (12,0). As the probability of theft 
is 50% under care, the owner is indifferent among C and C 4 = (0,12). On Figure 20.2 
the indifference curve joining Co to C4 passes below C\ = (6,6) because the owner is risk- 
averse. 



a Theft 






,^4 








Risk sharing 


no care 
s' care 


c 2 *' 

/ 


K X c 3 

1 1 


- : ^— > 



No theft 



6 7 



12 



Figure 20.2: Risk Sharing 



Competition among risk neutral insurers should drive them to offer the full insurance 
contract (p = 6,6 = 12) denoted Ci that leaves an insurer break even (provided that the 
policyholder takes care). The economic surplus S = u{6) - M(12) ~ M(0) goes to the car owner. 

Moral hazard appears immediately because the car owner has no more incentive to 
take care since u{C\,e) = u{6) - c < u{C\,e) = u(6): why take care if the car will be fully 
reimbursed whatever happens ? Given this change of behavior, the insurer now expects 
losses of 6- |l2 = -3 and will raise the premium from 6 to 9 (contract C 2 on Figure 20.2). 
Under this contract the utility of the owner is u{C2,e) = u{3). If this amount is less than 
u{Co, e) (as shown on Figure 20.2) then the owner won't buy insurance. 

Full insurance is thus incompatible with both the insurer IR constraint (offering a 
contract) and the owner IR constraint (agreeing to buy it). The optimal contract C* = 



(w* NT , w* T ) has to give an incentive to the owner to take care of its car i.e., must satisfy 

u{wnt) + u{wt) u(wnt)+3u{wt) 

c > 

2 4 

oAc<u{wnt)-u{wt) = u{\2 - p) - u{6 - p) (IC) 

i.e., the contract has to be below the bold dashed curve on Figure 20.2 known as the IC 
curve. 

Since insurer competition bestow all the surplus on the owner, the equilibrium con- 
tract will leave the insurer break even i.e., be the intersection C 3 of the IC curve and the 
insurer IR line joining C to C 4 . Let us assume to simplify exposition that the optimal 
reimbursement is 5* = 10. Then the break even premium is p* = 5 so that the income 
levels are w* NT = 7 and w* T = 5. The difference 12 - 5* = 2 is the deductible of the optimal 
insurance contract. The larger the deductible, the larger the risk supported by the pol- 
icyholder thus the greater the care he takes of the insured item. Full insurance is rare 
because 12 stands for the value of finding, buying and bringing home the item while a 
traditional "full insurance" contract only repays the market value. Hence the owner has 
always a little incentive to take care of his property. 

20.4 Renegotiation and Auditing 

In this last section, we treat additional topics relating to the renegotiation of contracts 
and the game of auditing/cheating between taxpayers and the IRS. 

Binary choice 

Let us introduce the idea of renegotiation and its aftermath by way of a simple example. 
An individual earns w by working the regular time but can obtain a bonus /3 by working 
extra-hours although this has an opportunity cost c> (5 to him so that his current dom- 
inant choice is to perform no extra hours. Let us now introduce into the picture a car 
of objective (market) value p and subjective value n to the individual with p <n<fi i.e., 
the car is more valuable than the market price p but lesser than the maximum salary. 
\fn-p>c-p+w i.e., it is efficient for the individual to own the car, then a bank can 
lend p to the agent to buy the car with the promise to repay p later on by working extra- 
hours. The reason why such a contract works is the following: by working hard, the 
agent makes enough money to repay the loan and enjoy the car, hence his final utility is 
n-p + fi-c> w. If on the contrary he decides to shirk then he won't be able to repay the 
loan, the bank will seize the car and his final utility will be w. The previous condition is 



the incentive compatibility one. 

If renegotiation is possible then when the bank faces a defaulter it cannot help but 
renegotiate. Indeed, the car being already used, it loses much of its market value thus it 
is better for the bank to accept the lower payment w from a shirker (instead of p) than 
seizing and reselling for even less. This totally changes the incentives for work since 
the utility of leisure is know w + n- w = n which can be greater than n- p + /3-c. In that 
case, the agent optimally chooses to shirk and in response the bank ceases to offer a 
loan knowing that it will make a loss p-w. There is a market failure generated by the 
inevitability of renegotiation. Banks are therefore forced to be tough (commit to seize no 
matter what) in order to be able to operate a Pareto improving intermediation service. 

Contractual renegotiation can be a problem because it is often unavoidable. If one 
party into an agency relation cannot engage into a long term relationship then contracts 
will be short-term and renewed frequently which is like the renegotiation of a long-term 
contract. This happens for regulators since the prices they impose on firms last no more 
than the regulatory period and there can be no insurance that the legislator or the gov- 
ernment will not change its policy the next time. Likewise, a sovereign state, unless tied 
by a treaty like the EU one or by membership of the WTO, will easily renege a contract 
with a firm, whether national or foreign. Beyond these cases of forced renegotiation, we 
find the voluntary ones: if the parties agree to tear- up the contract and write a new one, 
no one can stop them. 

Continuous choice 

Let us use our formal set-up to analyze the consequences of renegotiation. Once the 
agent has exerted the second best effort e, it not necessary anymore for the principal to 
use the costly bonus p(w{e),e). Indeed the bonus was offered to motivate the agent al- 
though it was forcing him to support unwanted risk; now that effort has been expanded, 
the bonus has played its role and should be removed before uncertainty resolves itself 
in order to fully insulate the agent from risk. Technically, the constant wage iu solving 
[l- q{e)) u[w{e)) + q{e)u[w{e) + /3) = u{w) is cheaper than the expected payment w{e) + q(e)/3 
stipulated in the original contract (this is so because u is concave). 

The principal just got trapped because the agent, anticipating this change to come, 
has no more incentive to produce the effort e; he is actually fully conscious that his 
final payoff will be constant although it appears at first sight to contain a bonus. The 
rational agent therefore chooses the minimum level e instead of e. This reasoning is 
true for any effort e > e because an incentive contract with a positive bonus /3{w ,e) is 
always renegotiated to a nil one (5 = which precludes (in equilibrium) the agent from 
performing e. 



To resolve this time-inconsistency, the principal must offer a menu of contracts con- 
tingent on effort that make the agent indifferent between all efforts in [<?; e] and, given 
the equilibrium distribution of effort a over [<?; e] , makes the principal indifferent be- 
tween renegotiating and not. We shall not delve into the computations but it should 
be clear that the average effort implement under this scheme will fall below the second 
best one e; thus we may say that committing to a single contract or committing to never 
renegotiate is helpful for the principal. In other words it pays "to be true to one's word". 

Notice lastly, that if the agent leads the renegotiation 18 ® and is rich enough to "buy 
the store" at price U{e), then renegotiation is not a problem anymore. Indeed the princi- 
pal will accept a change in the contract only if the agent offers more than Yl{e) which can 
be achieved only by expanding the second best effort e\ 

Debt renegotiation 

We show here that the seller of an item has an incentive to finance her activity through 
debt to dilute the bargaining power of the buyer. We consider in turn the two cases where 
debt is not renegotiable and renegotiable to show that the latter fosters the strategic 
purpose of debt. 

In most trading situation, the surplus generated by an exchange depends on many 
factors like future market demand or future price of oil not yet known when the parties 
devise their trade. From an ex-ante point of view the surplus is a random variable x e U+ 
whose distribution function is H{x) (h denotes the density). In this situation, the trading 
price has to be set trough bargaining once the surplus is known to the parties. The seller 
and the maker share it according to their respective bargaining abilities A and 1 - A. 

Whenever the surplus x is inferior to the debt service F, the seller is caught in the 
debt overhang problem thus she refuses to deal and closes her firm. As a consequence, 
the value of debt is only (1 - H{F))F. The value V of the firm for the seller is thus the 
value of her share 1 - A of profits x-F when she does not go bankrupt plus the funds 
raised from the debt emission i.e., 

/■oo /■oo 

V= \ (l-X)(x-F)dH(x) + (l-H(F))F=(l-X) I xdH(x) + A(l - H(F))F (20.14) 

Jf Jf 

The optimal level of debt F* solves -(l- A)h(F)F + X(l- H(F)) - Xh(F)F = <> X = -^f^ and 
is positive since the RHS is zero for F = 0. 

If debt has been issued by a single bank then the event of bankruptcy is not so sure 
because the bank might prefer to renegotiate the debt service F down to x. Credibly 
threatening not to trade, the seller gets all the surplus and use it integrally to repay the 
bank. The value of debt is now Vd - f™ FdH[x) + f xdH{x) so that the value of the firm 



for the maker becomes 



-< 



V = {l-X)\ (x-P)dH(jc) + Vb = (l-A)E[jc]+AVb (20.15) 



which is increasing function of F, hence the optimal level of debt is maximum. Renegoti- 
ation fosters the strategic purpose of debt which is to reduce the bargaining power of the 
buyer. This is so because the buyer loses all of his bargaining power when the firm is in 
danger of going bankrupt. 

Audit 

In all countries the tax authority (Internal Revenue Service in the US), is faced with 
"suspect" income declarations by tax payers. We can view the relation of a liable citizen 
(agent A) with the tax authority (principal P) as a game. Two forms can be thought of: 

► Simultaneity: the tax authority chooses to audit or trust a declaration while the citizen 
chooses to cheat or reveal its true income. 

► Sequential: the tax authority chooses to audit a proportion a of all declarations and 
then, knowing this policy, the citizen chooses to cheat or reveal its true income. 

In the first game, the strategy of the tax authority is the probability of auditing ap 
while the strategy of the citizen is the probability of cheating a A . Letting t denote the 
income tax, / the fine paid if caught cheating and c the unit cost of audit, payoffs are 

U A (a P ,a A ) = -a A a P {t + f)-(l-a A )t (20.16) 

Up(a P ,a A ) = ap{T-c + a A f) + (l-a P )(l-a A )t (20.17) 

There is no pure strategy equilibrium since a A = 1 (cheat) triggers ap = 1 (audit) itself 
triggering a A = (reveal). In that case there is no point to audit (ap = 0) and therefore 
the citizen is better off cheating (a A = 1). To solve this conundrum, we have to look for 
a mixed strategy equilibrium. The audit frequency a* p making the citizen indifferent 
between revealing and cheating solves 



dU A {a P ,a A ) 



= -a P {t + f) + t = 



a D = 
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and likewise 

dn P {ap,a A ) , . c 

= {t-c + a A f)-a-a A )t = => a A = 

oap t + j 

is the cheating frequency making the tax authority indifferent between auditing and 



trusting. The final payoffs are n^ = -t and U* p = t-jh as if the tax authority was always 
auditing and citizens would therefore always pay their taxes. Still in equilibrium some 
people cheat and some do not while some are audited and some are not. 

In the second game we are in fact assuming that the tax authority can credibly an- 
nounce that it will audit a% of all income declarations. Choosing a = ■£* + 1% forces each 
citizen to reveal because the probability of audit is now strictly superior to a* p . Final pay- 
offs are identical but no cheating takes place anymore thus the following year the tax 
authority will tend to (secretly) decrease a since it is a costly action but then citizens will 
anticipate this and start to cheat again !! 



Chapter 21 



Adverse Selection 



The private information held by an agent is crucial to determine the efficient decision 
in the agency relation. To make sure that this optimal action will be carried out, the 
principal must elicit this information. As intuition suggests, this can only be achieved 
by giving up an information rent to the agent (with respect to the complete information 
case). However this mechanically increases the marginal cost of production and creates 
a distortion or welfare loss. The principal must therefore arbitrate between productive 
efficiency and information acquisition. 

This chapter contains two section. We first model the unraveling and signaling of 
information in a market and then study various models of screening where firms design 
multiple contracts that are offered to all their potential clients to take advantage of the 
induced self-selection. 

21.1 Information Unraveling 

In this section, the item for sale has a quality only known to its producer (service) or 
current owner (good). In Akerlof (1970)'s model of partial equilibrium for second-hand 
goods, a market failure occurs that can even lead to complete market breakdown. When 
there is a single seller, Levin and Tadelis (2005) show how the firm adapts its internal 
organization to resolve the information revelation conundrum. In an application to the 
theory of the firm, it is shown that the partnership structure can dominate the corpora- 
tion structure if the quality of the service for sale is difficult to observe. 

In this section we shall get a better understanding of the law on currencies stating 
that "bad money drives out good" 1@ or Groucho Marx joking "I wouldn't want to belong 
to any club that is willing to give me membership" or businessmen saying that "trade is 
difficult in developing countries". 



21.1.1 Market for Used Cars 

Akerlof (1970) observes that in many markets, buyers use statistics to judge the quality 
of prospective purchases. In that case, there is an incentive for sellers to offer poor 
quality merchandise; indeed the returns for selling good quality items accrue to the 
entire group whose statistic is affected rather than to the individual seller. As a result 
of this free riding, there tends to be a reduction in the average quality of goods and also 
in the size of the market. 

A startling example is the large price difference between new cars and those which 
have just left the showroom. Automotive companies try their best to build perfectly 
good cars but there are always some copies that turn to be "lemons" (a colloquialism 
for defective used cars). Since the car is brand new, neither the buyer nor the dealer 
knows the exact quality of the car; there is symmetric information (or more precisely 
lack of). Things are different when you look at a used car because the current owner has 
probably learned whether her car was good or bad, she has acquired superior information 
by experiencing the car. There is now an asymmetry of information between buyers and 
sellers. Nevertheless, good cars and bad cars must still sell at the same price since it is 
impossible for a buyer to tell the difference. Rationally anticipating this fact, buyers will 
not accept a high price so that owners of good cars are trapped, they cannot sell their 
car for their real value nor for the price of a new car. Hence they remove their cars from 
the market so that actual sales are mostly lemons and since some potentials sellers have 
withdrawn, the total number of transactions tend to be low. 

Common Values 

We will show in a simple setting how an extreme market failure may occur: if sellers 
and buyers value cars identically then the market completely breaks down. To see this, 
consider a number of used cars whose quality or value 9 varies in the interval [d;d]. 
If the price is 0, then all owners want to sell so that S{9) = 1, the total mass of cars 
while if the price is §_, none of them find it profitable to sell, hence S{9) = 0. By the 
same token, demand is maximum for p = 6 and nil for p = 6. If, demand and supply are 
price responsive in the intuitive way then the two curves must intersect at some price 
P\ £ ]d;6[ for some quantity q\ as shown by the plain lines on Figure 21.1. We shall see 
that this price cannot be an equilibrium price at which trade can take place. 

When all owners offer their car for sale at price 6, the average quality of cars for 
sale is 0\ = E[0], the average quality of all cars, when the price goes down to p\, the 
supply shrinks to q\ = S[pi) < S{6) = 1 and we know for sure that some of the best cars 
have been pulled out of the market because their owners are the first to be hurt by the 
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Figure 21.1: Market for Lemons 



price decrease. Since potential buyers known that sellers are rational, they understand 
what's going on and therefore revise their estimate of the average quality of cars for 
sale down to 2 = E[0 \0 < 0\] < 8\. As the demand for cars depends also on their quality 
the effective demand curve will drop to reflect this updating. It is now clear that the 
price p\ generates an excess supply since D{6 2 ,pi) < D{0\,pi) = q\ = S{p\); it cannot be 
an equilibrium price. A further price drop down to p 2 will not solve the problem either. 
Indeed, it generates a lower supply, thus another downward revision of expected quality 
to some 63 = E [q 1 6 < 2 ] < 02 by buyers so that supply S(p2) will still exceed the updated 
demand D{Q- i ,p 2 ). 

This spiral of descending prices ends at 6_ where cars of exactly this value (the worst 
of the market) are sold for that price. If instead of cars we consider the metallic content 
of coins we obtain the law on currencies. Likewise, Groucho Marx considering himself 
a man of the street realizes that his joining a posh club would lower the average status 
of the club to the point where it is not worthwhile anymore to join. As regards general 
business, the adverse selection problem just mentioned can be circumvented by certifi- 
cation or guarantees. Private intermediaries like Veritas or TUV Rheinland and public 
ones likes the national or international certifications authorities certify conformity with 
security, health rules or ISO standards of products in order to remove part of the un- 
certainty regarding their quality (cf §24). 2@ Likewise, guarantees assure buyers that 
they will not end up with a "lemon". Since fulfilling the guarantee is costly for the seller 
when he makes products of poor quality, offering a guarantee is also a signal of quality 
(cf. §11.3) for consumers and competitors. For instance, car dealers often propose used 
cars with a 6 month guarantee (at a higher price) after revising them extensively. 



Differing Values 

Although shocking, the "lemons" example does not display a true market failure (there is 
no efficiency loss) since preferences were assumed identical between buyers and sellers 
so that ownership did not matter. Assume now that buyers value a car of quality at 
(1 + A)0 and are more numerous than sellers so that the latter capture all gains from 
trade. The market is in equilibrium at the price p* if the expected value of car sold for 
that price is exactly that price i.e., if (1+A)E[0|0 < p*] = p*. Using the uniform distribution 
over [0;0], we obtain p* = y^f 0. The equilibrium is said to be partially pooling because all 
owners of type < p* act in the same way, they sell at price p* . Likewise, all owners of 
type > p* act identically by keeping their cars out of the market. Although the market 
is active, its outcome is inefficient as buyers value cars more than sellers; in a world of 

_ f)—f) 

perfect information, all cars would be sold. This will occur here if p* = o A = t-=, that 

C7+D 

is to say if the differential in intrinsic value between buyers and sellers is large enough 
to overcome the information asymmetry. 

21.1.2 Corporation vs Partnership 

In service activities requiring a large human capital such as lawyers, doctors, architects 
or auditors, firms are rather organized as partnership instead of corporations. Levin and 
Tadelis (2005) explain this choice as a consequence of the large information asymmetry 
present in these services, namely that the market (potential clients) observes imperfectly 
the quality. 

Set-up 

In the activity under study, people's intrinsic quality is drawn randomly in [0;1]. The 
market wage for that activity is an average w e]0; 1[ of quality over people active in the 
entire economy. Due to state regulation, workers must study to earn degrees and pass 
many exams that enable a potential employer to screen them and thus hire only the most 
talented people. If the ability of the last person hired is q then the firm size is 1 - q and 
its average quality is E [0 10 > q] = A^-. Setting up a firm involves a fixed cost F. 

A priori, clients pay for the average quality of the firm but since it is imperfectly 
observed, there is a probability A that they pay for an average expect quality q e . The 
expected price is thus 

p = {l-\W[d\6>q\+\q e = {l-X) l -^- + \q e (21.1) 

Parameter A measures the extend of information asymmetry between sellers and buy- 



ers. By choosing its size (how many employees), a firm is choosing an average quality 
which affects the price more or less according to the severity of information asymmetries. 
We can now study the difference between corporations and partnerships. 

Corporation 

A corporation (A) maximizes the economic profit where employees cost is evaluated at 
the market wage, thus n - (1 - q){p - w ) - F. The FOC for optimal size is 

= — = — - [p - w) = w- ((1 - X)q + Xq e ) (21.2) 

i.e., the expected ability of newest employee as assessed by the market is the market 
wage. As || is decreasing with q, there is a unique solution to (21.2). In equilibrium, the 
market expectation for average quality is correct, thus q e = A^- so that 

w = (l-X)q + X^-oq A = ^r<w (21.3) 

since w < 1 but with equality for A = 0. Using the fact that in equilibrium p-w= (1 ~ ) 2 (1 ~ c?) , 
the equilibrium profit reads 

n A = 2 -^0^-F (21.4) 

Partnership 

A partnership (B) maximizes its per-capita economic profit u = y§- = p- w - -^—. Since 
J* oc {l-q)j£ + ji, the FOC for optimal size is || = f^- = -u < 0. Since || is decreasing, the 
marginal ability of a partnership is larger than that of a corporation (solution of (21.2)); 
hence a partnership always employs less people than a corporation. Using (21.2) and 
taking out the market wage on both sides, the FOC for an optimal partnership reads 
(1 - X)0 + Xq e = p - T3- i.e., the expected ability of the newest employee as assessed by the 
market is equal to the average gross profit share. In equilibrium, the market expectation 
is correct, thus q e = -4p = p (from (21.1)) so that 



a -X ) q + X^L = ^-^-o^^ = T l-oq B = l-J^ (21.5) 



2 2 \-q 2 \-q ^> a V 1-A 

The partnership profit then simplifies to 



its = a-w)^^-^F (21.6) 

Notice that our claim qs > qA is strictly equivalent to jta > 0. Inspection of (21.4) 



reveals that if c or A is too large, then the corporation is not sustainable and neither is a 
partnership, a result reminiscent of the lemons problem. 

To compare the two organizational structures notice that for A = 0, the corporation 
is efficient because it maximizes the true economic value of the firm i.e., qA - w. The 
resulting profit no = ^—^ — F is thus the overall maximum and we may conclude that 
corporation dominates partnership. Now, for the cleverly chosen A = j^~, we have -^- = 
2{F+7t ) = (1 - w) 2 , hence qs - w by (21.5). This time, it is the partnership that maximizes 
the true economic value of the firm and is therefore the most efficient organization. By 
the implicit function theorem, there must exists a threshold X\ £ ]0;Ao[ such that tta - 

_ 3@ 
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We conclude that the optimal organizational form is 

Corporation for low information asymmetries (A < A^. 
Partnership for intermediate information asymmetries (X\ < A < A ). 
Neither for large information asymmetries (A > A ). 



21.1.3 Signaling 

As in the previous analysis, when an economically relevant personal characteristic is 
unknown to the market or other trade counterparts, he may want to act so as to signal his 
peculiarity. The concept is introduced by Spence (1973) in the context of education where 
a worker decides to acquire a costly education to signal his innate high productivity to 
potential employers. 4@ In the same vein, advertising , on top of being informative (cf 
§11.5.2), can also be a signal of high quality. The idea applies also to a central bank 
whose monetary policy today is a signal of its willingness to accept inflation in the future. 
Employers then act according to the belief they form about future inflation. 

The difficulty with signaling is that the valuable agent faces the threat that a cheap 
imitator might mimic his attitude and ruin his reputation- and the wage that goes with 
it-when results reveal the true characteristics. The only way to reveal credibly his true 
identity is to undertake something so costly that imitators would not dare follow the 
same path. Yet this commitment or bonding being costly it must be wisely chosen. 

Education 

It is agreed that at least in the hard sciences, education adds cognitive skills to students. 
Their market productivity is thus increased and so is the market value of their labor. 
But education is also a process of socialization where one acquires skills such as the 



carrying out of assigned tasks, getting along with others, regularity, punctuality. Lastly, 
and this the point we pursue here, education is a filter in a world of imperfect informa- 
tion. Employers have a very poor idea of a candidate's productivity although they may 
know the distribution. By delivering verifiable certificates, schools sort out candidates in 
small groups whose productivity is easier to assess. The education system thus reveals 
information about those who go through but also about those who abstain. 

Consider a population of workers of either high or low ability in proportion a and 
1 - a. Their respective productivities are Oh and a lower 0/. If abilities were observable 
then competition between firms (zero profit condition) would drive differentiated wages 
to Wh = Ou and w\ = Q\. More plausibly, ability is unobservable. Each worker can then go 
to university for a duration of q months in order to signal itself to future employers. The 
disutility of schooling (sic) is c{q,0) = | i.e., both cost and marginal cost are lower for the 
high ability worker. Notice that education does not increase productivity, it only serves 
a signal of ability. Outside opportunities are normalized to zero. 

Firms compete for workers by offering a wage scheme w{q). In the absence of the 
educational system the market wage is equal to the average productivity E[0] = aOh + 
(1 - a) 0i. When a worker shows up with a level of education q, a firm's belief that he is of 
high ability is a q . Its rational wage offer is thus w[q) = a q 0h + (l - cc q )0i- 

In a pooling equilibrium all workers choose the same level of education q thus the 
belief after seeing q remains a so that the equilibrium wage is E[0]. The belief after seeing 
q ^ q can be a q = (if something that should not appear still appears then employers 
being prudent think that the worker is of the worst type) so that the optimal wage offer 
is Q\. Given this behavior, the best alternative to q for workers is to maximize 0/ - 1 i.e., 
choose no education whatever the innate ability. The proposed equilibrium choice q is 
thus optimal if E[0] - | > 0/ o q < a0{0h - 00 for both Oh and 0/. The condition is thus 
q < aOhWh - Oi) in order that the least able workers accept to go to university. Obviously 
the efficient equilibrium is q = as it saves workers the cost of going to university. 

In the separating equilibrium different types of workers chooses different level of 
education q h and qi, thus they are identified and paid at their productivity i.e., w {qh) = Oh 
and w{qi) = 0;. Beliefs following qh and qi are degenerate and for a different q we may 
set a q = 0. As for optimal schooling it is clear that whatever belief held by employers a 
low ability type cannot get a wage lesser than Q\ thus he shall either pretend to be a high 
type by choosing qh to get w = Oh or never go to school to avoid the disutility. Hence qi = 
and he will not masquerade if 0^ - f^ < 0/ o qu ^ qo = (0/j-0/)0/. Clearly a high ability 
type that spends more than q in education cannot be mistaken for a low type. Hence we 
get a class of equilibria where qh > qo and q\ = 0. It is now clear that other beliefs agree 
with this outcome; for instance a q = if q < qh and a q = 1 if q > qh- Agents utilities in the 
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efficient equilibrium where qn = c/o are u\ = 8i<u\ l = 8\ l - e ■ 

There exists equilibria where one type chooses one action while the other randomize 
between imitation and separation. For instance the high type chooses q^ while the low 
type imitates with probability A and separates with q = otherwise. The equilibrium 
belief is thus a x = a+[ i_ a]X an d the associated wage w {qh) = a x 0h + (1 - a x )6i. The low type 
plays a mixed strategy only if he is indifferent between qh and i.e., w(qh) - |* = 0/ o 

syfQ f) "1/3 

q h = q x = ^rkz^jx - tfo showing that the efficient separating equilibrium is the limit of 
the class of mixed equilibrium as A approaches zero. The off- equilibrium beliefs can be 
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q = Oifq<q x anda q = a+[ "_ a)X if q > q%. 

Advertising 

Kihlstrom and Riordan (1984) show that advertising can be a signal of quality in an 
oligopoly setting if a high quality producer is patient enough. 

The problem faced by a high quality producer is that in the absence of a cheap and 
reliable certification agency it is not able to inform correctly potential customers about 
the quality of its products. For such experience goods , the opinion of consumers is either 
confirmed or revised (upward or downward) according to what they bought and what 
they anticipated. The following "hit-and-run" strategy immediately comes to mind to 
take advantage of this delay in the revelation of true quality: package a cheap and low 
quality product as if it was top-notch, sell it at a high price for one period and make a 
large profit. Then, one lowers the price since the word has spread regarding what was 
the real quality inside the gleaming package. 

Let Uij denote the profit during one period made by selling quality i when consumer 
think it is quality j. We obviously have the ranking nih> iihh since cost are lower for a 
lower true quality but also n Ul >nu since consumers are ready to pay more for (what they 
think is) a higher quality. The present value of aggregated profits over many periods for 
the "hit-and-run" strategy is nih + \~kii where r is the interest rate 5@ and is obviously 
greater than the profit itu + \nu gained by a truthful maker of a low quality product. 

To force out the deceptive strategy of "hit-and-run", a high quality producer can 
simply spend an amount of advertising A larger than the difference n^ - nu because 
it will nullify the benefit of mimicking by a low quality producer. Yet the high qual- 
ity producer should take care of not spending too much in advertising because he can 
still change side and switch to the low quality product. 6 ® The latter condition reads 
iihh - A + jTthh 2: nu + jjiii o A < {Tthh - nil) ^y-- A first necessary condition for advertising 
to be a signal of quality is Tihh > nil i-e-, high quality products must generate more profits 
than low quality ones in a perfectly informed market. As shown in §11.3, the condition 
is likely to be satisfied. 



Comparing the lower and upper-bound for A we derive a second necessary condition 
for advertising to be a signal of quality: r < l hh ~H 11 i-e-, the producer is patient and cares 
mostly for future profits. The numerator is the truthful profit difference between the 
qualities while the denominator is the cost difference between the qualities. 



21.2 Screening & Self-Selection 

The problematic of information acquisition is referred to as screening. For instance, 
§4.3.3 shows a monopoly (or a firm with market power) trying to extract consumer sur- 
plus from its clients (agents). The valuable private information of agents is their willing- 
ness to pay for the good sold by the principal. When direct discrimination is unavailable, 
the firm resorts to indirect discrimination and designs a menu of contracts that is offered 
to all customers and out of which each type of client picks his preferred option, a pro- 
cedure called "Self-Selection". This way, the monopolist succeeds to obtain the private 
information although not completely since more interesting types (high WTP) ends up 
with an information rent. 

This section presents cases of procurement, monopolistic and regulatory screening. 7 ® 
Models are presented in order of difficulty so that each builds heavily on the previous 
one. In the first case, a principal wants to procure a service from an agent at the lowest 
possible cost and tries to screen candidates. In the second situation, known as non linear 
pricing, a firm holding market power discriminate among her clients with a variety of 
contracts to maximizes profit. The third model studies a regulator trying to elicit cost 
information from the regulated firm in order to apply an efficient pricing scheme and 
maximize welfare. We deal with information relative to demand and cost. The fourth 
model adds a moral hazard dimension to the basic procurement case; the scheme must 
now modulate the effort incentives on top of screening innate productivities. Lastly, we 
present the original insurance problem of "adverse selection". When insurers compete 
perfectly, it is possible to pick up the best clients (aka cream skimming) with a well 
designed proposal. This may force firms left with risky clients to go out of business so 
that a market failure occurs. 

21.2.1 Procurement 

Consider a firm procuring refuse collection to a city or a specialized firm procuring IT 
services to a manufacturer. According to whether the technology is cutting-edge or ob- 
solete, a quite different outcome is called for. It would be useful for the principal 8 ® to 
know the agent's cost in order to be able to fine tune the volume and price of the service. 



Yet, this information is often private knowledge to the agent; the principal thus faces a 
problem of "information revelation" and must "screen" the agent to uncover it. 

Symmetric Information The principal's valuation for output q is V{q) while her WTP 
for an additional unit is P{q) = V m {q). The agent's marginal cost 6 is either low (0/ = c) or 
high Wh = c + 8). This parameter which synthesizes the private information problem here 
is referred to as the type of the agent; in our simple polar example we have the good and 
bad types or the cutting-edge and obsolete technologies. 

The objectives are n = V{q) - t for the principal and u = t-Oq for the agent; their 
sum is the welfare W{d,q) = V{q) -6q. Money is thus simply a means to transfer utility 
among parties. Ex-ante, before the agent's type is determined, we deal with the expected 
welfare 

E[W] = a [V{qi) - cq t ] + (1 - a) [V(q h ) - [c + 8)q h ] (21.7) 

Pareto optimality calls for its maximization and can be achieved type-by-type with 
output q* = P~ l {6), the solution of V m = 0, for 6 = l,h. 

If the principal knows the agent's type 6, he can offer him a contract {q, t) with t > Oq 
in order to meet his participation constraint u > 0. Since the principal dislikes payments, 
it is optimal for her to saturate the previous constraint with t = 9q. Profit is now n = W 
and output remains the sole element to be chosen; its optimal level is the efficient one q* . 
This so-called "first-best" or ideal scheme is discriminatory since different contracts are 
proposed to different types of agents. Note the similarity with the basic moral hazard 
case of §20.1. 

Asymmetric Information When the principal ignores the firm's type, she can offer 
an "attractive" contract guaranteeing full participation i.e., satisfying the participation 
constraint of any type. Because types are ordered, the condition {V0, t>0q} boils down 
to t > Ouq i.e., the problem is bring the bad type on board. As before, it is optimal to 
saturate the latter so that the principal's objective becomes n = V{q)-8hq. This is as if 
the agent was, for sure, of the worst type, in which case the optimal output is q* h . This 
crude solution completely forsakes the fact that, on average, the agent is much more 
efficient and that welfare (or principal's profit) could be vastly improved. 9@ 

The adequate way to manage the diversity of possible technologies (types) is to offer a 
menu of contracts and let each type of agent picks his most preferred one. For instance, 
the principal could offer simultaneously the ideal contracts y* x = (q* v t* t ) and [q* h , tT), hop- 
ing that each type of agent will pick the one intended for him. The bad type will stick to 
Y* h (over y*) since by construction we have = t* h -dhq* h > tf -&hq\ - -qfWh-Qi)', this is 
because y* requires a heavy work load paid at less than the actual cost. Sadly, the good 



type will also prefer 7* as = t* -Q\q\ < tl~diq* h = q* h id h -di); this time, the workload is 
light but paid handsomely because from the point of view of the good type, he stands to 
make a profit equal to the marginal cost difference on every unit of output. 

The question then is whether we can we design a complex screening scheme that 
would lead to discriminating behavior so as to uncover the agent's type and have him 
perform in a relatively efficient manner ? The answer is twofold. Yes, we can discrim- 
inate (or screen) using only as many contracts as there are types provided they satisfy 
some incentive constraints. Yet, in the process, the principal has to give a rent to the 
good type and accept that the bad type under performs wrt. the ideal perfect informa- 
tion situation. 

Revelation Principle 

We tackle here the issue of complexity with the revelation principle. In order to motivate 
the agent to reveal his type (private information), the principal can offer a menu of 
contracts; for instance a red, a green, a yellow and a blue contracts. If type / picks the 
green one while type h picks the red one then it must be the case that type / prefers the 
green over the red while type h ranks them inversely. It is then obvious that the blue and 
yellow contracts were unnecessary in the first place. It is even useless to name contracts 
with colors; we can directly name them "low cost" and "high cost" (or use labels I and 
h). As shown by Myerson (1979), this intuition carries on to much more mathematically 
advanced models of asymmetric information. 

In our simple two types settings, the revelation principle allows us to restrict at- 
tention to pairs of contracts that are self- selecting in the sense that each type of agent 
prefers the contract designated for him over any other. The conditions that can be de- 
duced from these observations are called the incentive compatibility constraints (IC). 
Lastly, we must not forget the participation constraint i.e., the contract designed for a 
specific type of agent must leave him with a non negative profit for otherwise he would 
decline it. This yield a set of individual rationality constraints (IR). 

Second Best Solution 

Given a pair of contracts {qh,th) and (qi,t{), the principal's objective is to maximize ex- 
pected profit 

E[II] = a [V{qO - ti] + (1 - a) [V(q h ) - t h ] (21.8) 



under constraints 

f ti-6iqi>t h -6iq h © . f ti>6iqi © 

[IC)< and (IR)\ ^ (21.9) 

I t h -6 h q h >ti-d h qi © { t h >9 h q h © 

The (IC) constraints simplify into Biiqi-qu) <U-t h < 6 h {qi-q h ) => < {9 h -di){qi-q h ) => 
qi > qn since du >Qi- We also derive ti> t%. Observe now that © and © imply © since 
ti - Oiqi > th -Q\qh > th - dhqh ^ 0. Neglecting © for the moment, 10@ the optimum satisfies 
th = dhqh = {c + 5)qh (no rent for the high cost agent) and t\ = th + Oiiqi - qh) = cqi + 8qu 
(minimum rent for the low cost agent). We may then rewrite the expected profit as 

E[n] = a [V{qi) - cqi] + (1 - a) [V(q h ) - (c + 8)q h \ - a6q h (21.10) 

We see that wrt. the welfare objective of (21.7), an added cost appears; it is the in- 
formation rent left to the efficient type to motivate him to reveal his private information 
(here, his ability to perform at low cost). The optimum in the incomplete information 
setting is called "second best" because the informational rent generates an allocative 
distortion as we now demonstrate. 

The FOCs for maximizing (21.10) are Pi(qi) - c and Ph - c + yf- > c + 8. 

The first result is called "no distortion at the top" because the low cost firm (good 
type) is ordered to produce the efficient quantity. The high cost firm (bad type), on the 
other hand, is instructed to reduce output (and perform inefficiently) in order to lower 
the rent left to the good type. This distortion is meant to hurt a good type pretending to 
be a bad type, while not hurting the bad type as much. Thus, the principal is trading-off 
the efficient behavior of the low type with the information rent of the high type. Lastly, 
we check that ©, the incentive constraint for the low type, is satisfied at the candidate 
optimum as it reads 0> ti-6hqi<> qi> qh- 

21.2.2 Non Linear Pricing 

Non linear pricing by a monopolists involves selling to consumers of unknown type while 
procurement involves purchasing goods or services from providers of unknown type. The 
optimization procedures are thus dual one of another. 

Indirect Price Discrimination 

A firm with marginal cost c sells a good or service under a package {q, t) where q is 
consumption and t total price (fee). If potential clients form an homogeneous population 
sharing WTP P{.), the optimal discrimination scheme is to offer the efficient quantity q* 



solving P{q) = c for total fee t* = V{q*) where V{q) = f^ P{x)dx is the gross utility enjoyed 
from consuming q units (cf §4.1.3). 

Consider now the differentiated "home" and "pro" market segments whereby the lat- 
ter agree to pay premium 8 over the former i.e., we have WTPs P/(.) = P{.) and Phi-) = 
P{.) + 8 (and gross utility functions V\ and V^). 11@ Notice that contrary to the previous 
section, the "good" type is h. We may interpret the premium 8 > as a measure of the 
information asymmetry affecting the monopolist who cannot distinguish among the two 
types. 

By the revelation principle, the monopoly need only design two contracts (qu, h) and 
iqi,ti) satisfying self-selection. Letting ut = Vi{q{] - ti for i = l,h, the per-type profit is 
jit = ti - cqi = Viiqi) - cqi - ui (note that 7T; + U\ is welfare). The individual rationality 
constraints (IR) are u\ > while the incentive compatibility constraints (IC) are u\ > 
Viiqh) -t h = u h + 8qi and u h > V h {qi) -ti = ui~ 5q h which simplify into 

8qi<u h -ui<8q h (21.11) 

from which we deduce qt > qi (in a manner quite similar to (21.9)). 12@ The information 
rent of the "pro" is again the lower bound 8qi in (21.11); it is the net surplus he can secure 
by grabbing the proposal designed for a "homer" i.e., by pretending to be of a different 
type. 

As in the procurement case, we come to the conclusion that the firm leaves no rent to 
a "homer" (i.e., sets ui = 0) and leaves the minimum information rent to a "pro" (i.e., sets 
Uh = 8qi). The expected profit is thus very much like (21.10) with 

E[I1] = an h + (1 - a)%i = a [V h {q h ) - cq h ] + (1 - a) [ViiqO - cqi] - a6qi (21.12) 

The FOCs are c = Ph = P{.) + 8, leading to choose the efficient quantity qh = q* h , and 
Pi = P{.) = c + -^ leading to choose a reduced quantity qi < q*. 13@ The ratio -^ may be 
interpreted as the marginal cost of information acquisition. 

Observe that if a and 8 are large then P/(0) < fz^ might become true. In that case, it is 
better to exclude "homers" by setting qi = 0. The monopolist does this to avoid paying the 
information rent to "pros"; recall indeed that if the proposal is unappealing to "homers", 
the firm can extract all the consumer surplus from "pros" because she faces a unique 
segment of homogeneous people. 



Generalization 

In the most simple model, the average WTP for the good is P{q) so that average TWTP is 
U{q) = Jq P{x)dx. The heterogeneity among customers is captured by a random variable 
6 of law H and unitary mean with the following meaning: a consumer of type 9 has WTP 
6P{.). Taking into account the marginal cost of service c, the gross welfare associated to 
type 6 is w{9,q) = 0U(q) - cq. Net welfare is W[6) = w{d,qg) where the efficient quantity 
qt solves 0P{q) = c. If the monopolist could perfectly discriminate, he would sell type 
the amount q* against total payment t* = 6U{q*) or price at marginal cost together with 
a subscription f* -tl- cq* Q . 

When discrimination is impractical, the monopolist sets a variable unit price p{q) or 
equivalently a tariff T{q) = f^ p{x)dx. When faced with this tariff, a consumer of type 
derives a net surplus or utility u (6, q) = 0U{q) - T{q), thus chooses or demands the optimal 
level q(9) satisfying the FOC 14@ 

eu'{q) = T'{q)o6P{q) = p{q) (21.13) 

Since the WTP is decreasing, a larger 6 moves the LHS upwards and thus forces an 
intersection with the p{.) curve further on the right (whatever its shape may be); this 
means that q{9) is increasing. 15 ® 

If the firm wants type to consume some particular level q, it is enough to set p{q) = 
6P{q). This shows that the firm is able to induce any consumption path across types 
by tuning her tariff adequately. Hence, the search for an optimal tariff T(q) is a quest 
for an optimal path of consumption q{0). We already know that the firm cannot fully 
extract the consumer surplus and must leave each type a rent above his opportunity 
cost. Indeed, the indirect utility or equilibrium surplus is v{6) = u{6,q{6)) and satisfies 
v ' = U by the envelope theorem. Integrating, we obtain a cumulative formula with v{Q) = 
v(9) + fSu(q(x))dx. 

Setting w.l.o.g. the opportunity cost at zero, the participation constraint is u{9, q{0)) > 
0; it must hold for all types but it is in fact enough to make sure it holds for the lowest 
one since u[6,q{6)) > [d-§)U{q{6)) > i.e., higher types can always use the scheme 
designed for the lowest type and get more than their opportunity cost. Now, it is obvious 
that the firm will adjust her scheme so as to leave no unnecessary rent to the lowest type 
i.e., v{6) = holds at the optimum. Ex-ante, the rent paid to the entire population is 

E[v{0)] = \ v{6)h{6)d6= j J h(6)U(q(x))dxd6 = 1(1 -H{6))U[q{6))d6 



using integration by part and assuming the distribution H has a density h. Finally 

E [ v{6)] = j 1 ~ h T? ) U{q(e)) dH{6) = E [p (0, q{6))} (21.14) 

where p{6,q) = 1 "^j U(q). The usefulness of this expression is to disentangle the exoge- 
nous type parameter from the endogenous quantity choice of the consumer. 

By construction, profit is welfare minus utility i.e., T{q) -cq= w(0, q) - u{6, q). Taking 
into account the fact that each type of consumer demands a particular amount q{9) of the 
good, the firm earns n[6) = 8U[q{0)) - cq{6) - v{9) per-capita. Using (21.14), the ex-ante 
profit simplifies into n = E[7r(0)] =E[<p[6,q{0))] where cp{0,q) = 0U{q)-cq-p{0,q). It is now 
obvious that the maximization of the ex-ante profit requires an ex-post maximization for 
each distinct type of customer: the optimal consumption q{6) is that maximizing (p{9,q). 
Using (21.13), the FOC is 

[BU'{q) - c) h(0) = (1 - mO)) U'(q) o — = 1 ~ H ® ) (21.15) 

p eh{6) 

The rightward version is a Lerner index (cf eq. (3.4)) showing how the efficiency 
condition is distorted away by the monopolist. Note however the absence of distortion 
(at the top) for the highest type who also consumes most. Conversely, the distortion is 
greatest for the lowest type. The leftward version of (21.15) shows that the monopolist 
willing-fully creates a deadweight-loss to reduce the information rent left to high type 
customers. Indeed, the LHS is the gain from marginally extending sales to type e times 
the population of this particular type while the RHS is the infra-marginal loss of allowing 
a greater rent to all better types. 

For most statistical distributions, the hazard rate yzhM is increasing, so that the 
RHS of (21.15) is decreasing with type i.e., the price solution is decreasing. Now, rewrit- 
ing (21.13) as P(q) = p{q)/e, we see that the RHS decreases with type. Since WTP is 
decreasing, it must be the case that the optimal output is increasing as required to be 
the optimal choice of the consumer. 16@ We may thus conclude: 

21.2.3 Public Firm Regulation 

As explained in §17.2, the ideal (second best) regulation for a public service is a per unit 
price close to marginal cost together with a subscription large enough to cover fixed cost. 
Choosing these two parameters is not an easy task for the regulator because she can 
only use the cost report handed over by the firm's manager who might therefore have an 
incentive to overstate true cost. This possibility is a real issue given his better knowledge 



of the production technology. Combined with the fact that public funds involve a loss of 
efficiency (cf. §17.1.2), the optimal regulatory policy will be distorted as if there was an 
additional marginal cost of eliciting the private information of the firm. 

Another issue affecting regulation is the cyclicality of demand of many public ser- 
vices. The existence of peaks and off-peak periods warrants a flexible price policy from 
an efficiency point of view. For reasons better explained in §25, inter-temporal discrimi- 
nation is often forbidden and leads to excess capacity and large deadweight losses. In a 
few cases such as the pricing of highways operated under franchise, regulators are more 
and more willing to adapt their regulatory framework to account for this reality. Yet they 
face the problem of eliciting the demand from the operator in order to set the price at 
the efficient level. This is what we explore in the first part before delving into the more 
involved case of unknown cost. 

Private Information on Demand 

To assess the variability of demand, we simply assume that the market size parameter 
a in the usual demand formulation Dip) = a- bp is a random variable observed only by 
the regulated firm, thus unknown to the regulator. The firm has to build a capacity of 
service k (at marginal cost 6) and can produce up to k units at marginal cost c. 

Efficient Pricing The efficient pricing rule is found by following a simple procedure. 
Try first to price at marginal cost; if the resulting demand can be met (D{c) < A;) this is it; 
if on the contrary there is potential congestion (D{c) > k), then raise the price until excess 
demand vanishes (Dipk) = k) i.e., in our example, set pk = ^. This is the most efficient 
manner to ration consumers i.e., that which minimizes the welfare loss generated by 
the limited service capacity (cf. §25.3.2 on congestion). The threshold market size such 
that marginal cost pricing does not generate congestion is ak = k + be. We thus obtain 

n-V I C if A < flfc 

p fc (fl) S mm{c,f}= a _ k . 

I ~b~ n a - a k 

The fact that the efficient price is a rule depending on some piece of information un- 
known to the regulator poses a problem for he has to trust the firm to reveal it correctly. 
However this is not going to happen because the firm has an incentive to overstate the 
true demand. Indeed, if the firm claims that the demand parameter is some a greater 
than the true level a, then the regulator will allow the price pkia); it then remains to 
adjust a so as to equate pki&) with the monopoly price. 17@ 

Fortunately, Riordan (1984) displays a simple contract restoring the incentives of the 
firm toward the common good: the firm is allowed to set the price p, has to meet demand 
up to capacity and receives subsidy Skip) = 8k-ip- c)k, contingent on her announced 



price. 18@ The scheme involves a fixed subsidy covering the capacity cost and a variable 
tax component to extract any margin made over sales. The firm's profit is now n{p) = 
[p - c) min \D{p) , k\ - 6 k + s^p) < by the very definition of s^C). Hence, the most that she 
can expect is to earn zero (extraordinary) profit. We now show that pkl) is a rule that 
precisely enables to achieves this maximum. Indeed, the firm earns: 

{{c-c)D-8k + {c + 8-c)k =0 ifa<ajt 
[pk-c)k-8k + {c + 8 - pk)k =0 ifa>ajt 

I We conclude that the optimal regulatory mechanism is akin to a two-part tariff 
whereby both the subsidy and the unit price respond to shifting demand conditions. 

Efficient Capacity The efficient capacity is found by equating the marginal cost 8 of 
capacity expansion to its marginal value which is the wedge between the WTP of the 
first rationed consumer (i.e., the price) and the marginal cost of service c. Since the price 
varies with the intensity of demand, the efficient capacity k* solves E[pfc(.)] = c + 8. 

When the regulator offers the price contingent subsidy Ski-), the firm prices efficiently 
and thus reveals the state of demand. The regulator is thus able to compute the expec- 
tation E[pfc(.)] to ascertain whether the actual capacity is too small or too large. 

Once the firm has been instructed to invest efficiently at the level k* , the subsidy 
mechanism becomes budget balanced; indeed, the expected subsidy is E [Skl)\ = k [c + 8 - E [p^ 
since the firm is motivated to price efficiently. This expression is zero precisely when the 
efficient capacity A;* is build. This is a very convenient property since deficit or surplus 
are always source of costly haggling among stake-holders in the regulation process. 

Private Information on Costs 

When the regulated firm knows better than the regulator its ability to perform the job, 
we have a situation similar to procurement. The only difference is the principal's objec- 
tive: a regulator cares for both consumer surplus and firm profit but is reluctant to tax 
the rest of the economy to finance the firm. The marginal cost of public A funds alluded 
to in §17.1 will introduce a slight modification of the optimal second-best regulatory pol- 
icy. 19 ® We follow Laffont and Tirole (1986) and Baron and Myerson (1982). 

The procurement model for variable output seen in §21.2.1 is easily adapted. The 
regulated firm's profit is n = t-dq where the marginal cost is di = c with probability a 
or Oh = c + 8 with complementary probability. Raw consumer surplus is U{q) (with WTP 
P{.)) while a contract is an output-transfer pair {q, t). The regulator's objective is welfare 



net of the cost of public funds i.e., 

W = U{q) -dq-Xt= U{q) - (1 + X)dq - Xn oc V[q) -Oq-rn (21.16) 

with V{q) E gg and Ts _i_. 

The (IC) and (IR) constraints are identical to (21.9) leading to an information rent 
Tdqh once transfers are minimized. Expected welfare is thus 

E[W] = a [V{qi) - cq x \ + (1 - a) [V(q h ) - (c + 6)q h ] - ar8q h (21.17) 

which is identical to (21.10) except for the multiplier t in the last term. 

The FOC for the low cost firm is V m = c o P{.) = (1 + X)c i.e., she produces the efficient 
quantity (no distortion at the top). The FOC for the high cost firm is P{.) = (l + A)(c+<5) + ^rf 
i.e., equalizes marginal benefit with a virtual marginal cost summing the true marginal 
cost of production and the cost of eliciting the firm's private information. 20@ Although 
the regulator does not care per-se for wealth distribution, he tries to avoid distortionary 
tax collection and is thus eager to minimize the information rent left to the low cost firm. 
Consequently, the dearer firm is instructed to produce less than what would be efficient 
to avoid imitation by a cheap firm (qj, < q*). 

21. 2 A Procurement and Moral Hazard 

In many instances of procurement, the principal (e.g., government, firm) contracts the 
agent (e.g., builder, maker, consultant) to produce a single and idiosyncratic item such as 
a bridge, an industrial design or a firm re-organization. In that case, the principal would 
like the agent to specialize over that task and invest to develop specific skills that would 
reduce the overall cost the project. 

As before, the agent's basic technology can be cutting-edge or obsolete, an informa- 
tion known only to him. The novelty here is the possibility to meliorate the technology 
(reduce cost) by investing into assets or capital specific to the delegated task (cf §14.2). 
Denote /3 the cost saving undertaken be the agent under a regime of residual claimancy 
(cf. §20.1). 21@ This investment, however, is not contractible as it involves personal and 
unobservable skills and actions. This element of moral hazard (cf. §20) generates an 
adverse selection problem. 

Worst Case: CP and FP 

Under a Cost-Pius Contract (CP), the principal repays the agent all his expenses (re- 
lated to the task). The latter seeing that any improvement effort falls on his shoulders 



will undertake none (cf. §20.1.2). The principal thus reimburses the full cost and on 
expectation pays 6 = E[0]. 

Under a fixed price (FP) contract, the principal offers the fixed price t for completion 
of the item. The agent, whatever his type, then becomes residual claimant of any tech- 
nology melioration thus invest optimally and saves p. His payoff is then t-9 + ft. He 
will accept the initial offer only if t > 6 - /3. If the principal insists on getting the item 
no-matter-what, she has to set a high price to attract the high cost firm (bad type) i.e., 
t>0h-/3 (with equality at the optimum). In that case, the low cost firm (good type) enjoys 
a large rent. 

This "no risk FP" contract is better than CP if 6h - P < 6 o P ^ oc8 i.e., when the gain 
from optimized technology is large, so that providing good incentives is important. Con- 
versely, if the information asymmetry is large (high 8) then CP is better to avoid leaving 
such a large rent to the low cost agent. 

Middle Case: a simple option 

A frequently observed contract is a CP with an option to switch to FP. Obviously, if the 
fixed price t is low, no one ever picks the option so we are back to the pure CP case. 
Likewise, if t is large, everyone picks the FP option and we are back to the "no risk FP" 
case. For intermediate t, only the low cost agent picks the FP option and implements the 
maximal cost saving /3. Among these, t = di- ft is optimal as it minimizes the payment. 
The expected cost for the principal is thus a{6i - p) + (1 - a)8h = 6 -aft which clearly dom- 
inates the CP contract. The practical value of this simple option contract is that it can 
be computed without any knowledge of the agent's ability to improve his technology i.e., 
without resorting to a screening mechanism. 

Best Case: no information asymmetry 

To develop fully the model we follow footnote 26.3 and denote e a marginal cost reduction 
and d{e) the financial cost of achieving it. The principal's objective is to minimize the 
expected cost 

E[C] = a [di - ei + d{ei)] + (1 - a) [d h -e h + d{e h )\ (21.18) 

where e, is the level of cost savings implemented by the agent (which depends on the 
contract he agreed to). 

If the principal could observe the agent's type, he could taylor a FP contract for each 
type i = l,h with t* = 0i - /3. This offer would be accepted and followed by implementation 
of the efficient cost savings so that realized cost would be 0i • -/3. The principal would thus 
get the item done for an expected cost of E[C] = 9 - /3 i.e., the maximal improvement over 



the CP base case. Equivalently, the principal offers the agent to bring cost down to 6, ■—$ 
and pays him exactly that amount. 

In the realistic case where the principal does not observe the agent's type, she has to 
devise a revelation mechanism. We already know from the previous models that naively 
offering the efficient FP contracts t* h and t* t fails because both types prefer the greater 
payment t* h i.e., the outcome is the same as with to the "no risk FP" contract. 

Second Best: complex option 

Our basic instrument is the target contract (c, t) whereby the principal agrees to pay the 
agent t + c if the cost - e is brought down to c. When type accepts such an offer, he 
must implement a minimum effort saving e = 6-c to get the payment. 22 ® His profit (rent) 
is then n{6, c,t)-t- d{6 - c) since procurement expenses are paid by the principal. 

The principal offers contracts (c/, t{] and (c/,, tu). We denote ei = 6; - ct for i = l,h and 
m = n{di, Ci, tt). The (IC) and (IR) constraints are then m > n{0i, Cj, tj) and m > for i = l,h 
and j ^i. With target contracts, if the low cost agent picks contract {Ch, h), he will have 
to exert effort d\- c\ x - e^-Q <ey l because he is intrinsically more efficient than the high 
cost agent. On the contrary, if the high cost agent picks (c/, ti), he will be forced to exert 
effort dh~ci = ei + 6> ei since it is more difficult for him to pretend to be a low cost agent. 

The principal's objective is to minimize the expected cost of producing the item 

E[C] = E [a + t t ] = E [dt - ei + d{ei) + m\ (21.19) 

under the (IC) and (IR) constraints 

| ti-d{eO>t h -d{e h -6) (/Q) ^ f t t >d{eO (IRi) (2120) 

[ t h -d(e h )>ti-d(ei + 6) (IC h ) * [ ti>d{eO (IR h ) 

As before, the (IC) conditions simplify to d{e\ + 6) - d{e^) > ti - t^ > d{e{) - d{eh -6) = v 
which is the information rent that the low cost agent receives to reveal his type. Mini- 
mizing rents by setting 7th = and ii\ = v, the objective becomes 

E[C] = a [6 1 - ei + d{efi] + (1 - a) [8 h -e h + d(e h )] + a [d(e h ) - d(e h - 6)] (21.21) 

which is the complete information formula (21.18) plus the cost of information acquisi- 
tion, the informational rent paid to the low cost firm (good type). The FOCs are 

d'(ei) = l and d'(e h ) = 1- -^—(d'[e h ) -d'(e h -6)) (21.22) 

l-a 



leading to optimal choices e/ = e* and et < e* i.e., no distortion at the top and an optimal 
reduction of effort at the bottom to reduce the information rent. Lastly, we need to 
check that (IC^) is satisfied at the candidate optimum. The condition reads n h = > 
ti-d{ei + 6) o d{eh)-d{eh-8) < d{ei + 8)-d(ei) which is true because we assumed d convex 
and we have e/, < e; at the optimum. 

This complex scheme improves over the simple CP-FP option but not so much as we 
now proceed to show. 

Extension to many types f 

To compare the second-best optimal self-selecting menu of contracts with the simple 
CP-FP option, Rogerson (2003) assumes that the agent's type is uniformly distributed 

2 

between Q\ and 6h- We also take d{e) = |g in order that e* - 2/5 and that ft is indeed the 
maximum of e - d{e). 

A CP-FP option at t leads all types below 6 = t + p to pick the FP option and all the 
types above to stick to CP The contract is thus parametrized by t or 0. The expected cost 
is then C(0) = fS [0 - /3) dF[x) + f Bh xdF(x) and its derivative is proportional to f(0)[0 -p- 

6) + F{6). With the uniform distribution, the equation becomes — j 1 = % and the optimal 
cutoff level is thus 6 = mm{6h,6i + ft} = 0i + min{<5,)6}. The optimal optional FP is then t* = 
6 1 + min{0, 5 - j6} . The expected cost is C = 6 - min j /3 - |, ^ \. The percentage improvement 
of the simple option over the basic CP in terms of the maximum cost saving k is then 
p^ = min{l-!,§}. 

Under the optimal second-best contract, the principal offers a menu (c(6),t(6)) for 
de[9i,d h ]. We let e(d) = d-c(6) and n(9) = t(d) -d(e(6)). The IC condition is n(d) = -d(e(.6)) 
while the expected cost is E[6 - e(d) + d(e(6)) + n(d)]. The FOC for 6 is then d(6) = 1 - 
|H d{6). In our uniform distribution example, it reads A = 1 — ^ leading to e{6) - 

minlO^^ + e/ -6}. Finally Rogerson (2003) shows that p* = minjl - ^ + j|^,§§}. 

The ratio of p to p* is always greater than | and equal to this value for 8 > 2p\ Hence, 
a simple CP contract with a fixed price option computed without knowledge of the agent's 
preferences is within 75% of the second-best efficiency frontier. 

Accounting Regulation 

Laffont and Tirole (1986) studies the previous regulation problem with the slight im- 
provement that realized cost are retrievable from the accounting statements. This al- 
lows her to instruct the firm to produce efficiently, conditional on the realized cost. The 
regulator's problem now is that she is unable to disentangle the role of effort e from the 
underlying marginal cost 9 in the final unit cost c = 6 - e. A low observed cost occurs 



only when luck and hard work are combined together while a high cost is a sure indica- 
tion of low effort (and back luck). The identification problem lies with an intermediate 
observation that can be due to luck if 6 = Q\ or hard work if 6 = Q^. 

The adverse selection formulation follows §21.2.3 and adds a moral hazard issue as 
in §21.2.4. Welfare is thus a combination of (21.16) and (21.19) with W = V{q) -{9- e)q- 
d(e) -Xt. Since profit is n = t - {6 - e)q - d{e), the expected welfare 

E[W] = a [V(qi)-a + X)[(0i-ei)qi + d(ei))-X7ti] (2123) 

+ {I- a) [V{q h )-{l + X)[{9 h -e h )q h + d{e h ))-An h ] 

is to be maximized under the (IC) and (IR) constraints (21.20). As in the previous model, 
the rents are minimized so that (21.23) becomes 

E[W] = a [V(q l )-a + X){(6 l -ei)q l + d(e l ))-A(d(e h )-d(e h -6))} 

(21.24) 
+ (I -a) [V(q h )-(l + A){(6 h -e h )q h + d(e h ))\ 

Observe, as claimed initially, that the FOC for quantity is P = (1 + A) [6 - e) i.e., produc- 
tion is efficient conditional on the achieved level of marginal cost. As for moral hazard, 
the effort of the cheap firm is the efficient level e\ since it solves d'{eO = q\ but the dear 
firm's FOC is d'ien) = qh~ d+Axi-g) [d'(eh) - d'(eh-S)) so that §h < e* h . There is an indirect 
effect on production i.e., q^ < q* h , since a lesser effort raises the marginal cost while for 
the low cost firm, we have e\ = e* and thus qi = q* . 

Continuous types f To simplify we consider the procurement case where just one 
unit of the good is needed (q = 1). The random variable follows the distribution H. 
Since profit is n = t- {6 - e) - d{e), the first best effort e* solves d' - 1. The regulator can 
either mandate effort e* or requires to bring cost down to c* = 6 - e* , the transfer is then 
adjusted to cover all costs i.e., t = - e* - d{e*). 

Under incomplete information, although c is observed, 6 is not anymore so that the 
efficient effort cannot be verified. The regulator can propose the firm a payment against 
a cost level. By the revelation principle, he will offer only schemes (£(.), c(.)) inducing 
truthful revelation of types. Profit upon revealing but being type is <&{6,6) = t{6) - 
c{6) - d{d - c{6)). The incentive compatibility constraint is || = so that n{9) = ®{9,9) 

satisfies it - -d{e{6)) where e{6) = 6-c{6). We may write n{6) = n{6) + f g d{e{x))dH{x) and 
since there is no need to leave excessive rent to the firm, we have n{9) = 0. 

As the value of the procured object is fixed, the regulator objective is to minimize the 
expected cost E[(l + X) {0 - e + d{e)) + Xti] over IC schemes. If he increases the effort of type 
6, whose density is h{6), he reduces cost by (1 + X){d - 1) but the IC constraint forces him 



to increase the rent left to all better types by d{e) and there are H{9) of these. The level of 
effort for type 9 is optimal when h(9)(l+A)(d-l) = H(9)Ad\e) od = l- J) ^^ Observe 
that only the most efficient firm picks the efficient effort (no distortion at the top) since 
H{.) = at the bottom of the distribution. For worse firms, effort is sub-optimal but their 
information rent is lesser. 

Variable Quantity f Assume total cost is C{9, q,e) = {9 - e)q + d{e) + F where 9 is the 
base marginal cost, F the fixed cost, e is the effort or investment of the management 
into cost reductions and d the opportunity cost of effort or disutility (assumed increasing 
convex). The profit is n = qP{q) - C(9,q,e) + t and the optimal effort for the manager is 
e{q) solving |^ = d! - q = 0. Total welfare is 

W(9, q, e) = V(q) - C(9, q, e)-At = V(q) - (1 + A)C(9, q, e) + AqP(q) - AU 

so that the efficient effort is also e{q); hence it is efficient to leave the firm elicit effort. 
Observe now that 

W [9, q, e(q)) = V(q) - (1 + A) [(9 - e(q))q + F + d(e(q))] + XqP(q) - AU 
= W D (q)-(l + A)9q-AU 

where Woiq) = V{q) + (1 + A) [e{q)q- d{e{q)) -F]+ AqP{q) is the so-called "social valuation" 
of the production that the regulator can compute provided he knows d. (cf 2.1 of Laffont 
and Tirole (1993) for public goods). The efficient production, called the Ramsey solution, 
solves (1 + A)9 = W' D = (1 + A) [P(q) + e(q)) + AqP'(q) 

A , p-c A 1 
o P + qP =9- e{q) = coL= = 

where L is the Lerner index (cf. eq. (3.4)) and e is the price elasticity of the demand. If 
there is no tax distortion (A = 0), we fall back on marginal pricing. 

21.2.5 Insurance Cream Skimming f 

We present here a simplified version of Rothschild and Stiglitz (1976)' analysis of insur- 
ance markets. Consider insurance against economic losses c due to an unexpected event 
like a fire that forces a firm to stop production or an accident stopping an individual from 
working. The underlying risk for each client that seeks insurance is different, thus the 
insurer will try to gather a maximum amount of information to assess each riskiness. Yet 
some differences among clients remain unobservable. There are safe customers whose 



accident probability is Oi and unsafe ones whose accident probability is a greater dh- A 
policy consists of a premium p and a reimbursement d in case of damages. The expected 
utility of a type 8 customer is 

U(q, c) = 6u[d-c-p) + a-0) u(-p) (21.25) 

where u is increasing concave since customers are all risk averse (otherwise they wouldn't 
seek insurance). The per capita profit of a risk neutral insurer is n = p - 9d. 

Pareto efficiency commands to eliminate risk by setting d = c. This efficient risk 
sharing outcome is achieved by competitive insurance markets insofar as risk classes are 
correctly identified by insurers; indeed, free entry and perfect competition drive economic 
profit to zero i.e., to actuarially fair premiums p = Be. 

Cream Skimming 

As soon as the different types of agents cannot be distinguished, the insurance market 
breaks down; more precisely, there cannot exist a pooling equilibrium where the same 
contract y = [p, d) is bought by all types of agents. The underlying reason is that the 
average insurance cost, E[0], stands between the real values Oi and Oh meaning that 
safe customers generate profits that can be used to cover the losses generated by unsafe 
customers. 

Cream skimming is the process whereby a competing insurer succeeds to attract prof- 
itable safe customers and woo away the unprofitable unsafe ones. The trick is to offer 
a deductible increase (Ad < 0) i.e., force customers to assume more risk, together with 
a premium reduction (Ap < 0) that compensate safe risk customers only. To prove this 
claim, we compute 

AU(6) = du'[d-c-p)Ad-[du'{d-c-p) + (l-e)u'(-p)]Ap 

1-6 u'(-p) 

oc Ad-Ap -Ap 

8 u'{d-c-p) 



AU(d) Ap 

oc 1 

Ad Ad 



l-O u'{-p) 

1 + -. — — 

9 u'[d-c- p) 



(21.26) 



and observe that AU{6) < for a small ratio ^ while it is positive for a large ratio -^. 
Hence we may adjust -^ so that AUWi) - holds. Now, observe that the last term in 
the bracket decreases with the risk index 6, thus AU{6h) < i.e., the conditions for cream 
skimming hold true. The change in the contract leaves a low risk indifferent while re- 
pelling a high risk. 

For the original y to be a candidate equilibrium, it must not generate losses, thus 



p - E[d]d > so that p - 8 id > 0. We can now choose Ad small enough to guarantee that 
the insurer offering the altered contract earns the positive payoff p + Ap-6i(d + Ad) = p- 
B\d + 1 -£j - 6 1 \Ad > (whatever the sign of -^ - 6i since Ad is small). We have thus shown 
that cream-skimming is at work with this alteration of the original pooling contract that 
simultaneously attract low risks, repel high risks and generate profits. 

Separation of Risks 

The equilibrium is thus separating i.e., insurers offers menus of contracts and different 
risks elicit different contracts that identifies them afterwards. Since in our simple set- 
ting there are only two classes of risk or types, insurers need only offer two contracts 
Yi = (pi,di) and y h = iph,dh) that will be picked up by the safe and unsafe customers re- 
spectively (cf revelation principle seen in §21.2.1). Technically those incentive compat- 
ibility (IC) conditions are £/(#/, y z ) > U{0i,y h ) and U{6h,Jh) > U{0h,j0- Furthermore com- 
petition among insurers guarantees zero per-capita profits i.e., pi = Qidi and ph = Oudh- 
The unsafe type IC condition U{dh,Jh) - U{6h,Ti) reads 

f{d h ) = Q h u{{l-6 h )d h -c) + {l-B h )u{-d h d h ) 
> g(di) = 6 h u((l-d0di-c) + (l-d h )u(-6[d0 

As u is concave, /' oc u'{-ph -c + dh)- u'{-ph) > thus (21.27) is more easily satisfied 
by increasing dh- This means that the Bertrand type competition among insurers 23@ will 
drive df, to c i.e., unsafe types receive full insurance and obtain a utility /(c) = u{-9hC). 
Likewise competition tends to increase d\ but (21.27) is violated at di = c since everybody 
prefers the low premium pi when both contracts give full insurance. Therefore the safe 
risk will have to support some risk {d\ < c). Since 6i<6h^> OhiX - 0i) > (1 - Qh)6i, we have 

g'(di) = e h a-Oi)u'ai-Oi)di-c)-a-o h )6iu'(-9idi) 

> a-e h )6i[u'{a-0i)di-d)-u\-6idi)] >o 

Having proved that g is increasing, there exists d* < c such that g{d*) = u(-dhC) so that 
(21.27) is satisfied for any reimbursement d\ < d* . The benchmark reimbursement d* 
leaves the unsafe risk indifferent between revelation and imitation. Once again compe- 
tition among insurers drives rf/ to its upper limit d* , so that y z = {6id*,d*). Lastly, we 
must check the incentive condition for the safe type: C/(0/,y h ) < [/(#/, y z ) o 

u(-e h c) = g(d*) = e h uui-0i)d*-c) + a-e h )u(-6id*) 

< 6iuUl-6i)d*-c) + (1-0/M-M*) 



is true since the weight on the larger term u{-6[d*) is increased, increases. 

Equilibrium 

What we have characterized is the optimal pair of separating contracts. An immediate 
observation is that a cream-skimming contract is a non-optimal separating contract thus 
it is not a candidate equilibrium. To know whether y l = [did*,d*) and y h = [6hC,c) form 
an equilibrium we must check that there does not exists a pooling contract f preferred 
by both types of risk. 

Observe indeed that an insurer could offer f = {p; c) where the premium is computed 
to make the safe risk indifferent i.e., £/(6>/,y z ) = u{-p). Then the IC condition for the safe 
type tells us that unsafe risks would have a strict benefit in switching from y h to f. The 
expected profit is then p - [XOi + (1 - X)6i l ) c and will be positive if the proportion A of safe 
types in the population is larger than g~_ p g c . The reason behind the existence of this 
profitable deviation is that the proportion of unsafe risk being small, it is not worthwhile 
for insurance companies to seek separation of types because it forces the large majority of 
safe risk to support costly risk; hence any insurer can offer a Pareto improving trade (an 
insurance service) to all consumers even if this means losing money on the few unsafe 
risks that are around. 

So, when A is large, a pooling contract f can successfully attack the separating {Ji'J}) 
but it is itself attacked by a cream skimming contract which is not a candidate separat- 
ing equilibrium. This circularity proves that there is no equilibrium. Still, the model 
of insurance competition should be completed because the reaction of an insurer whose 
customers are stolen by a competitor matters. When attacked by a cream skimming 
contract f, the "old" pooling contract f is withdrawn because the zero profit condition 
p - {ad i + {l-a)6h)d = yield losses once the safe types are gone. But then the cream 
skimming contract f has to serve all types and becomes itself vulnerable to a cream 
skimming attack. The same reasoning applies in case of a pooling attack over a separat- 
ing contract. 

To solve this inconsistency, Wilson (1977) proposes to redesign the competition be- 
tween insurers as a stage game where: 

• Insurers simultaneously offer "old" contracts (pooling or separating). 

• Insurers simultaneously offer "new" contracts. 

• Insurers simultaneously withdraw "old" contracts if they wish to. 

• Customers can sign any contract. 

In this game, the equilibrium is the separating one characterized earlier for small 
A; otherwise it is the pooling contract f giving the highest level of utility to safe types. 



Indeed if A is large, a cream skimming attack against f needs A = 0/ < -^ < P = dh hence 
generates a profit of (A0/ + (1 - A)0/j) Ad-Ap< A(0/ - 0fc)A<i < 0. 



Chapter 22 



Auctions 



Auctions are very competitive trading mechanisms, so much so that the textbook exam- 
ples of "perfectly competitive" markets are often the colorful markets for fish, cattle, wine 
or flowers which all use an auction to allocate commodities. An auction is an organized 
contest (cf §7) that is cheap to set-up and enable the prompt selling of almost any item. 
It also has the crucial ability to extract the precious information that economic agents 
might possess regarding the item for sale; this explains the appearance of this chapter 
in the Part on asymmetric information. Modern references are Klemperer (2003) and 
Milgrom (2004). 

The chapter is organized as follows: we first shed light upon the origin and main uses 
of auctions with an emphasis on the assignment of natural monopolies. We then compare 
the main auctions before inquiring into the optimal auction (for the seller) and efficient 
one (for society). 

22.1 Purpose of Auctions 

22.1.1 Origins 

Auctions are a very useful and old exchange mechanism. There are records of auctions 
for slaves around 1900 BC in Assyria (Irak), for virgin brides and slaves around 500 BC 
in Babylon. In BC Rome, auctions were commonly used to sell real estate, slaves or goods 
as follows: an auctioneer sets a low starting price and waits for participants to signal a 
higher price; they can call out openly or nod in which case it is the auctioneer who sets 
the new (higher) standing price. When no one dares to bid above the standing price, the 
bidder who made it gets the item and pays the seller his bid. As astounding as it may 
sound, the entire Roman empire was once sold in an auction! 

This auction form, which we call Roman to distinguish it from other forms, 1@ has 
been used in Europe, the Middle East and Asia continuously over a thousand year. 2 ® 



Nowadays, the real life auction closest to the open ascending model prevalent in the 
literature is a variant of the Roman auction used in Japanese fish markets. 

In the XX th century, governments have used auctions to sell treasury bills, foreign 
exchange, mineral rights, oil fields, assets of firms to be privatized, public land or prop- 
erties and more recently air waves for television or telephone. In the private markets, 
houses, cars, agricultural production, livestock, art and antiques are commonly sold by 
auction. Even more recent are the internet auctions to sell used items and the business 
to business (B2B) procurement auctions whereby firms compete to sell or buy at bargain 
price their inputs or outputs. 

22.1.2 The case for auctioning 

Perfect Competition 

An auction is the practical trade mechanism that comes closest to perfect competition. 
Standardized goods such as financial assets (stocks, options, derivatives), grain or min- 
erals are traded in exchanges (trading posts, clearinghouses) where anonymous buyers 
meet anonymous sellers in a double auction (cf §22.2.1). 3@ The remarkable property 
of such markets is to extract the private information of participants. Since nobody has 
market power in these large markets, the optimal pricing strategy is to be "price-taker". 
As a consequence, individual demand equates (market) price and (personal) marginal 
willingness to pay; symmetrically, individual supply equates price and marginal cost. 
The bidding behavior of all participants therefore reveal perfectly all their economically 
relevant information and it is aggregated in the equilibrium price. We obtain the first 
welfare theorem according to which efficiency is reached in a competitive market. 

Revelation Mechanism 

It would seem that the previous mechanism fails to work properly to sell a unique item 
such as a painting, a house, a mineral field or a mobile phone license. Indeed, being so 
exceptional or unique, the item does not have a well known market value so that both 
sellers and buyers are unsure of how much they should ask or bid. 

Organizing an auction is an answer to this problem. The owner needs to advertise the 
event to attract the largest possible number of participants because each potential buyer 
will bring his own small piece of information relative to the item for sale. The seller can 
then hope that during the auctioning process these information bits will be revealed by 
the participants through their bidding behavior (like in a competitive market). This way, 
the seller can sell the item to the person who values it most, thereby maximizing her 



revenue. Similarly, whenever an economic agent wants to buy or procure a service or a 
very specific item like a museum or a railroad line, he can use a procurement auction to 
attract many potential contractors (sellers of the service) and award the production of 
the item to the least demanding candidate in order to minimize spending. 

Natural Monopoly 

Government intervene natural monopoly markets (cf chap. 17) and often deliver one or 
a few licenses to operate. Ideally, this should be done with a view to maximize efficiency 
i.e., look for the firm best able to provide high quality at low cost. 4@ In the past, the al- 
location process was often a "beauty contest" where contenders would propose a detailed 
plan of activities and the government would select that best fitting its needs (cf. Table 
7.2). Such a scheme presents two obvious problems. Firstly, the promises included in 
the plan are hard to check and enforce, thus hard to believe in the first place which ul- 
timately means that few elements can really be used to decide between offers. Secondly, 
this selection process is open to wasteful lobbying and corruption; it is widely believed 
that such methods have seriously limited entry, challenging and innovation (cf. §10 & 
§12), an issue better known as regulatory capture. 

The Chicago school, may be inspired by ancient Greek politics, proposed an alterna- 
tive method immune to rent-seeking: random allocation. Once some lucky person gets 
the license, all interested firms will try to buy it back from him. 5@ The ensuing compe- 
tition should guarantee efficiency since the license will go to the firm with the highest 
WTP. The underlying strong assumption here is the absence of transaction costs i.e., the 
negotiation between parties is neither time nor lawyers consuming. 

This option was tried in the US during the 1980s when the FCC organized lotteries 
to allocate radio and TV waves; it lasted until 1993 and lead to a severe fragmentation 
and to very costly negotiations between telecommunication firms and all sort of arbi- 
trageurs. Experience (and the comparison with Europe) has revealed the existence of 
large transaction costs. It seems quite obvious that the lucky arbitrageur who wins a 
license and the telecommunication firm willing to buy it back have private information; 
none of them knows clearly how much the license is worth for the other. The arbitrageur 
knows nothing about telecommunications and the firm ignores whether the arbitrageur 
already got an offer from a competitor. As we formally show in §22.3.3, this asymmetry 
of information makes bargaining inefficient in the sense that the license is some times 
resold to a firm that does not value it most. 

This failed experiment reinforce the case for devising a wise allocation mechanism 
that extracts the relevant information from the participants and identifies directly the 
highest value for the license. Economists are inclined to believe that auctions have fared 



quite well in this respect as we shall argue in the theory section. 

Collusion 

Bidders to an auction have an incentive to collude (form an illicit cartel) in order to get 
the item at better conditions than if they were competing one against the other. In the 
case of auctions among private economic agents the presence or absence of collusion is 
a matter of surplus distribution among buyers and sellers but when the public power is 
involved, reducing its surplus is akin to increasing public spending which causes ineffi- 
cient distortions in the economy. Antitrust authorities therefore actively fight collusion 
in procurement auctions for public works; the large number of prosecutions and judg- 
ments against groups of firms is proof that collusion is widespread in auctions. 

McAfee and McMillan (1992) show that whenever the members of a cartel are un- 
able to make monetary transfers (because of anti-trust surveillance) the best they can 
do to manipulate an auction is to bid the same amount, a feature commonly observed 
in procurement auctions. If they are able to redistribute the rents of collusion among 
themselves, then they behave as a monopsonist in the original auction and resell the 
item among themselves in a private auction. 

Robinson (1985) argues convincingly that collusion is easier in a second-price auction 
(open or sealed) than in a first-price one. Indeed, for an item of known value 50€, it is 
enough for the designated winner to bid a high value like 100€ and for all other colluders 
to bid a low 7€ ; this way no one will ever cheat. Indeed, with a bid like 101€, the cheater 
would end up paying 100€ and lose 50€ while if he bids 30€, he doesn't win the item but 
hurts the winner who will retaliate in subsequent auctions. Such a deception is much 
more difficult to play in a first-price auction because the designated winner ought to bid 
only slightly more than his mates, say 8€; but then anyone can outbid him at 9€and 
make a profit of 41€ . 

22.2 Comparing Auctions 

In our theoretical presentation, the original owner of the item for sale (she) stands also as 
the auctioneer who organizes and runs the auction. We shall first consider her encounter 
with a single potential buyer (he), before generalizing to several. 

22.2.1 Typology 

Auctions can be classified according to their rules. A one-sided auction sees a unique 
seller or buyer proposing an item to several bidders. Two-sided or double auctions put 



buyers and sellers in contact through the auctioneer. Bids can be secret (sealed) or public 
(open). This last case opens the possibility of competition among bidders by alternating 
bids. We present the most frequent ones and indicate in parenthesis the familiar name 
of each. 

• Open ascending (Roman): in the Japanese electronic version, the price starts low and 

goes up with the clock time. Bidders push a button to enter the auction and release 
it to leave (no reentry permitted) so that the winner is the last holding bidder; he 
pays the standing price. 6@ Examples: art auction houses. 

• Open descending (Dutch): the price starts from a very high level and falls with the 

clock time until someone says mine (pushes a button); the winning bidder gets the 
item and pays the standing price. Examples: Flower sales in the Netherlands (cf 
case study by Kambil and van Heck (1996)) or the Google IPO. 

• First-price sealed (Sealed): Bids submitted secretly in written form to the auctioneer 

who sorts them and award the item to the highest bidder who pays his bid. Exam- 
ples: procurement contest or airwaves licenses. 

• Second-price sealed (Vickrey (1961)): same as before except that the winner pays the 

second-highest bid. Example: stamps sales by mail in the US during the late XIX th . 

• All-pay: every bidder pays his bid and the highest bidder receives the object. Exam- 

ples: any lottery (sweepstakes) or equivalently any match opposing two teams; each 
participant's bid is the effort he produces to win the match. 

• War of Attrition: bidders put repetitively an equal amount (of money time or effort) on 

the table until all but one drop out. The item goes to the last standing bidder (cf. 
§7.4). 7@ 

• Double Auction: participants submit supply and demand bids, then the auctioneer sort 

demand bids in descending order and supply bids in ascending order. Matching 
the two curves yields an equilibrium price at which all feasible transactions are 
executed. 

Drawing on the characteristics of the double auction, Walras (1874) offers a theoret- 
ical description of a competitive market: the auctioneer (called Walrasian to distinguish 
from real ones) cries out a price and waits for demands and supplies. He then lowers 
the price if supply exceeds demand and raises it otherwise. This "tatonnement" pro- 
cess eventually converges to an equilibrium price. Today most electronic markets use an 
automated Walrasian auctioneer that offers an equilibrium price continuously equating 
demand and supply. 

The other crucial element in the study of auctions which justifies its presence in 
this Part is the information setting, more precisely what the seller and the potential 



buyers know regarding the item for sale. In the case of private values, each bidder has a 
privately known willingness to pay for the item, called the value and no one else knows 
it. This applies for a fixed quantity of raw standard material or a license for a mobile 
telephone network because the willingness to pay of a potential buyer is determined by 
the technology where the item will be used as an input. In the common value situation, 
the item for sale has a unique market value which is all that matters for buyers but no 
one knows it exactly e.g., an oil field. Lastly we can mix the previous settings, letting the 
item having an intrinsic objective value but also a subjective one for each bidder e.g., a 
painting which you might appreciate on top of knowing that it also has a market value 
for resale. 

Before delving into theory to compare the 4 standard one-sided auctions, let us recall 
that the obvious advantage of open cry auctions is the speed at which large quantities 
can be sold; this is particularly crucial for fresh food or flowers and explain why Roman 
and Dutch auctions have been used for centuries. 

22.2.2 Standard Auctions 

Some formal comparisons can be made among the 4 standard one-sided auctions. In all 
the sealed auctions a strategy is simply a price (to be written down the porposal) while 
in the open auctions, a strategy is a stopping rule, a limit price that can depend on what 
the player has observed while the standing price was evolving. 

First-Price Auctions: Dutch and Sealed 

In the Dutch auction no information is revealed until some says mine triggering the end 
of the auction, thus a strategy is a single figure just like in a sealed auction. Given 
that in the Dutch and Sealed auctions, the winner pays his bid, these two auctions are 
strategically equivalent; this is true whatever the information context. Hence we might 
see the Dutch auction as open first price. What distinguishes them in the real life relates 
mostly to collusive behavior among bidders and transaction costs, the speed at which it 
is conducted and can be repeated. 

Second-Price Auctions: Roman and Vickrey 

When values are private, the Roman and the Vickrey auctions are also equivalent albeit 
in a weaker sense. In the Roman auction, if values are private and statistically indepen- 
dent then a player learns nothing from the fact that some people are dropping out as the 
price goes up. In fact, it is a dominant strategy (optimal whatever do the other bidders) 



for him to stay until the public price reaches his own value because dropping out before 
is to lose an opportunity to get the item cheap while staying too long is taking the risk of 
buying dear. 

In a Vickrey auction, a strategy is a price v (equal or different from the true value v). 
The optimal strategy for a bidder, say #1, is to play his personal value v whatever the 
behavior of other bidders and whatever the information structure. Indeed, letting u be 
the highest bid made by someone else, say #2, we see in Table 22.1 that telling the truth 
is identical or better than understating and also identical or better than overstating. The 
• cells indicate when lying reduces the payoff of bidder #1. In both cases, the winner is 
the bidder who values most the item but he pays only the second highest valuation, 
thus the Roman auction could be relabeled "open second price" since the Vickrey is the 
sealed-bid second price auction. 



bidder #1 / bidder #2 


u<v 


v < u< V 


v < u< V 


v < u 


understate v = v < v 


win, v - u 


lose, * 


lose, 


lose, 


truth v = v 


win, v - u 


win, v - u 


lose, 


lose, 


overstate v = v > v 


win, v - u 


win, v-u 


win, v- u • 


lose, 



Table 22.1: Gain for bidder #1 



Affiliated information 

The private information of bidders is rarely independent, rather it is often affiliated in 
the sense that when one bidder is optimistic i.e., receive information stating that the 
item for sale is very valuable, it is more likely that other bidders' are also optimistic. In 
this context and assuming risk-neutral bidders, whose signals are drawn from symmetric 
distributions, and whose value functions are symmetric functions of the signals, Milgrom 
and Weber (1982) show that the following ranking in terms of seller's revenue: Roman » 
Vickrey » first-price auctions (Dutch or Sealed). This theory is backed by the prominent 
use of the Roman auction through history and settings. 

The intuition behind this classification is that the surplus of the winning bidder is 
due to her private information. Hence to maximize revenue, the seller looks for an auc- 
tion able to extract the winner's information. When information is affiliated, this effect 
will be stronger the more the price paid depends on others' information. The standard 
auction where the price most depends on all bidders' information is the Roman auction; 
indeed, the winner has seen everybody else drop out and has been able to infer much of 
these events so that his own winning bid reveals a maximum amount of private informa- 
tion. In the Vickrey auction, a similar but weaker phenomenon takes place because the 
price depends only on the second-highest bid. Lastly, in the first-price auction (Dutch or 



Sealed), a player's bid incorporates no information in addition to his own. 

By the same token, if the seller has some private information, he should release it 
to augment the information at the disposal of bidders since this will motivate them to 
bid higher. The general principle stating that expected revenue is raised by linking 
the winner's payment to information that is affiliated with the winner's information, 
is known as the Linkage Principle. In practice, sellers of art or exploration rights pay 
independent experts to assess and reveal the likely value of the item for sale. 

22.3 Optimal Auctions 

22.3.1 Revenue Equivalence 

Vickrey (1961)'s revenue equivalence theorem states that if a fixed number of identical, 
risk-neutral bidders, who each want a single unit, have independent information, and 
bid independently, then all 4 standard auctions yield the same expected revenue to the 
seller. 

Revelation Principle 

We follow Myerson (1981) to prove this important result. The first step is to demonstrate 
Myerson (1979)'s revelation principle: When a person dressed in red participates in the 
auction (or any other selling mechanism with very complex rules) taking a series of 
actions, the seller can record them on a sheet and title it "red strategy"; she can do 
likewise with all the participants dressed in blue, green or any other color. The fact that 
the red dressed man plays the red strategy in equilibrium of the auction game means 
that, for him, it dominates the green strategy; likewise the green strategy dominates the 
red one for the green dressed man. Since the preferences of participants are captured 
by their WTP for the item, any two people with the same WTP will behave identically. 
Thus, the seller will record (possibly) different strategies only for people with different 
WTPs. This imply that we can drop the color labeling system and use instead one based 
on the WTP of participants. 

The second step is to use this principle to characterize a participant's payoff. A 
bidder with WTP v (his private information) will get the item with a probability (p{v) 
that depends on the rules of the auction, the behavior of other participants and obvi- 
ously his own optimal strategy 8@ Likewise, he expects to pay some amount 9 ® t[v), so 
that his expected surplus is u{v) = vcp{v) - t{v). In equilibrium, it does not pay for bid- 
der v to act as if he was v i.e., use the optimal strategy of that person so as to win 
the item with probability cp{v) and pay an expected t{v). Analytically, this reads: 10@ 



u{v) > v(p{v) - t{v) = u{v) + {v- v)(p{v). Since it won't pay either for v to act as if he was v, 

we obtain 

(p{v)> >cp{v) (22.1) 

v- v 

out which we deduce two results. Firstly, 

v>v=>(v-v) {(p{v) - cp[v)) > => (p{v) > (p{u) (22.2) 

By playing optimally in an auction, a bidder with higher WTP guarantees himself a 
greater probability of winning the item i.e., (p > 0. 

The second result is obtained by letting v converge towards v; at the limit u [v) = <p(v) 
must be true. Integrating this equality, we can write 



f 

Jo 



u[v) = u{0)+\ (p{x)dx (22.3) 

Jo 

The interpretation goes as follows: given that the bidder follows an optimal (equilibrium) 
bidding strategy, his final surplus depends only on the probability of winning (and his 
payoff in the worst case). Notice that because q> is increasing, u is convex meaning that 
additional WTP becomes more and more valuable to win the item at a good price and 
earn a maximum surplus. 

Whenever two auctions generate for all bidders the same probabilities of winning and 
the same surplus in the worst case then they generate the same surplus function u{.), 
the same payment function t{.) and finally the same revenue for the seller; this is the 
general form of the revenue equivalence theorem. In the 4 standard auctions, the item 
goes always to the bidder who has the highest value thus the probability of winning is the 
same because it depends only on the statistical distribution of values. Furthermore, a 
bidder drawing the lowest value v cannot win (with positive probability) thus her surplus 
is always so that the theorem applies yielding our initial claim. 

Optimal Bidding 

As we already saw, the optimal bidding strategy in the Vickrey auction with independent 
values is one's own value, a behavior known as truth telling or truthful revelation (of the 
private information). If values are private and independent then the same is true in the 
Roman auction. 

The revenue equivalence enables to compute the optimal bid in first-price auctions 
because expected payments are identical to those of second price auctions. Now, the 
expected payment is the product of the winning probability by the expected price, and 
we know that the winning probability is the same in all 4 standard auctions if values are 



private and independent. In that case, the bid bt(y) in the first-price auction is equal to 
the expected price E[v\v < v] in the second price auction with v being the second highest 
bid. Hence, my optimal bid in a first-price auction is the expected second highest value 
conditional on being lesser than my own. 

Let us compute this for player #n. We first look for the distribution of the random 
variable u, the highest bid of the other n - 1 players. Since all values are drawn form the 
distribution H with density h = H', the probability that player #1 bids in [v;v + dv] is h{v) 
while the probability that the remaining n-2 players bid less than v is H{v) n ~ 2 ; since 
identities do not matter the density of v is g{v) = {n-l)h{v)H{v) n ~ 2 . Using integration by 
parts below, we obtain the optimal bidding strategy as 

which, as expected, is lesser than the value to reflect the fact that players want to make 
a profit in these first-price auctions and are aware of the possibility to win the item 
without bidding their private value. 

To give a practical use to formula (22.4), let us assume that each buyer's value is 
uniformly distributed over [0; v] i.e., H{v) = vl v, then b{v) = v-Jq (~ )"~ dx - v-\ -^^ - 
v-vln. Notice that the optimal bid does not depend on the top value but only on the 
number of contenders. We see here why it is interesting to attract many participants 
in an auction: it motivates all of them to bid more aggressively which raises the seller's 
revenue. 

A phenomenon well known among practitioners is the winner's curse according to 
which the winner of an auction has paid more than the value of the item. Whenever 
some people receive optimistic information while others receive pessimistic information, 
values are not private anymore, they become correlated (common value model). In that 
case, the winning bid, being made by the player who received the highest signal, is 
overtly optimistic and frequently overshoots the true value of the item. In theory, player 
take this phenomenon into account when computing their optimal bid. The key to com- 
pute my optimal bid is to forget about the item's expected value (based on my private 
information) and concentrate on its value when my bid is the highest, hence taking into 
account that other did not dare bid so high. 

22.3.2 Optimal Selling Mechanism 
Intuition 

Bulow and Roberts (1989) introduces this topic with the follow example: Wallace would 



be ready to pay some price between and 10 while Gromit would pay some price between 
10 and 30 for a painting. In a Roman auction, Gromit systematically outbids Wallace and 
pays exactly his value so that on average the seller gains 5. Setting a starting price of 10 
would raise that payoff to 10 because Wallace would never bid and Gromit would wisely 
bid his minimum WTP. An even better idea is to set a starting price of 15. True, not even 
Gromit would participate when his value is less than 15 (probability |) but the average 
revenue would jump to |l5 = ll,25. 11@ The question is then to identify criteria that can 
help us build rules of an optimal auction. 

Observing that the seller's pricing problem is quite similar to that of a standard mo- 
nopolist, one can assimilate the distribution of bidders' values to a demand curve and 
compute a marginal revenue curve. An optimal (revenue maximizing) auction is then 
one where the item goes to the bidder whose marginal revenue is greatest; this includes 
the seller herself if no marginal revenue surpass her reservation value v . This scheme 
is implemented by a modified second-price auction where each participant bids a value 
(openly or secretly), then the seller then computes the marginal revenues and gives the 
item to the bidder with the greatest marginal revenue. This auction is incentive compat- 
ible in the sense that participants truthfully reveal their value. 

The previous analysis of an optimal auction (for the seller) makes the search for an 
efficient auction (socially optimal) quite simple. We only need to give the item to the 
bidder with the highest valuation; this includes the seller herself if no bidder value 
surpass vq. 

To achieve this, we can use the Vickrey auction with reserve price vq since we already 
proved that bidders reveal their true WTP in that auction. By the revenue equivalence 
theorem, the 4 standard auctions with reserve price v are efficient. This result looks as 
a promising allocation mechanism but a word of caution is necessary when dealing with 
the auctioning of governmental licenses. As we explain in §12.2.3, an incumbent operator 
(already owning a license) has more to lose from entry than the challenger can hope to 
win after entry, thus the incumbent will strategically raise his bid for a second license to 
outbid the challenger in order to block his entry. This could lead to an inefficient outcome 
if the challenger had a better technology or simply because a monopoly is maintained, 
instead of evolving towards a more competitive duopoly market structure. 

To compare the previous optimal selling mechanism and the efficient one, we observe 
that since the marginal revenues of the potential buyers are all increasing functions, the 
ranking of values yields the same ranking of marginal revenues so that the object goes 
to the same person whenever it is sold. The only time where efficiency fails is when the 
monopolist computed reserve price p M is greater than his true valuation v because in 
that case she keeps the item even though someone may have a higher valuation. 



Discriminating Monopoly Analogy 

We now develop the previous intuition to uncover the optimal selling mechanism. If the 
seller was an omniscient druidess, she would be able to rank the WTP of participants 
V\>v 2 > — > v n and ask bidder #1 to pay V\ in a "take-or-leave-it" manner i.e., act like a 
perfectly discriminating monopoly (cf §4.2.1), rip a maximum revenue and perform an 
efficiency enhancing trade since the item would go to the economic agent best able to use 
it (unless her own valuation v is greater so that efficiency commands her to keep the 
object). In real life situations, the WTP of participants is a private information to each 
of them. Maximizing revenue is therefore akin to maximizing information revelation. 
We adopt Bulow and Roberts (1989)'s heuristic analogy of auctions with monopoly price 
discrimination as treated in §3.3.1. 

The seller's dilemma is almost identical to that of a monopoly facing a market demand 
because facing one buyer whose WTP you ignore is formally identical to facing a large 
population of indistinguishable buyers. 12@ Indeed, a typical demand curve D{.) like that 
displayed on Figure 2.4 can be seen as a series of n = D(0) people ready to buy a single 
unit of the good, ranked by their WTP. At the price p, monopoly sales are the fraction 
D(p)/D{0) of the market size i.e., there is a proportion H{p) = l-D{p)/D(0) of buyers whose 
WTP is lesser than p. An alternative way to read this is the following: if a potential 
buyer is picked at random from the population and offered the item for the price p, he 
will accept with probability D{p)/D{0) = 1 - H{p). 

Now, an auctioneer does not pick a buyer, rather a buyer presents himself at the 
auction and although she ignores how much he would be ready to pay for the item, she 
knows his population of origin. The latter is completely described by the distribution 
function H(.) which we assume known to the auctioneer, just like we always assume that 
a monopolist knows the demand function D{.) without knowing what it is made of. 

Marginal Revenue f 

Consider the encounter between the seller of the item and a potential buyer drawn from 
a population whose statistical distribution is H. When the seller offers the item for the 
price p, the probability of a greater buyer WTP is 1 - H{p) which is also the probability 
of a sale; this amount plays the role of quantity q in the standard monopoly analysis as 
illustrated on the left panel of Figure 22.1. From an ex-ante point of view, the expected 
revenue is R(p) = p x (l- H{p)) which can be written R{q) = qH x {\ - q) using the p — q 
change of variable. We can then compute the marginal revenue R m {q) = H~ l {l-q)- H ,% ■. 



and using once again the change of variable q into p, we derive 

Rmip) =Rm{l- H{p)) = p - - - F - < p 

Hp) 



(22.5) 



which is plotted 13@ on the right panel of Figure 22.1 for the uniform distribution over 
[0; v\. We now use the fact that R{q) = f^ R m {x)dx to express 

R(p) = p{l-H(p)) = R{l-H(p))= [ R m (v)dH(v)^p = E[R m (v)\D>p] (22.6) 

meaning that the price is the expected marginal revenue conditional on the bidder's 
value being larger than this price. 




q q 



Figure 22.1: Virtual Utility 



Optimal Bilateral Sale f 

We devise here the optimal mechanism to sell one item to someone whose WTP is drawn 
form the distribution H. The seller's opportunity cost is v . The expected surplus of the 
potential buyer, E[u], is computed from (22.3): 



E[u] 
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= u(0)+ f <p(v)^fdH(v) 



(22.7) 



by an exchange of integration order. 

A bidder with WTP v is willing to take part in the auction if u{v) > and this con- 
straint will be always satisfied if it is satisfied for the lowest possible WTP i.e., the only 
participation constraint that matters is u{0) > 0. Since revenue is t{v) = vq>{v) - u{v), we 
can use (22.7) to check that the seller earns on expectation: 



E[f] 



/ \ l ^^ )y^ dH ^)-M(0) = E[y(g)^m(g)]-M(0) (22.8) 



Taking into account her opportunity cost Vq of renouncing to the item, her (producer) 
surplus 14 ® is 

W s = E [(p(v)R m (v) + (1 - (p{v)) v ] - u(0) = i; - u(0) + E [<p{v) {R m (v) - v )] (22.9) 

so that its maximization leads her to sell the item i.e., set cp{v) = 1 only for values v 
such that R m {v) > vq. Letting q M solve R m {q) = v and p M solve 1 - H{p) = q M , we check 
on the left panel on Figure 22.1 that this optimal rule generates sales to a proportion 
q M of potential buyers which is exactly the standard monopoly quantity given that the 
opportunity cost v o is the marginal cost of "producing" the item. Furthermore, the rule 
satisfies the condition cp' > since ip is nil over [0;p M ] and unitary over [p M ; v\. 

To implement this outcome it is enough to behave as a standard monopoly i.e., ask 
a price p M = H _1 (l - q M ) for the item or to set-up the following auction: the buyer bids 
a WTP v and is allowed to buy the item at the price p M if the marginal revenue cor- 
responding to his offer, R m {v), is greater than vq. It is a simple exercise using the left 
panel of Figure 22.1 to check that a buyer with WTP v has no interest to lie i.e., he will 
truthfully reveal v = v (the proof for the general case is provided in the next paragraph). 

When comparing the two previous mechanisms, the second looks dumb since it adds 
a bidding stage that seems irrelevant. That is correct in the present setting but once 
there are several bidders, it becomes a useful device to force a maximum revelation of 
information. 

Optimal Auction f 

We derive here the optimal auction for the seller and the efficient one which are close. 

Consider n independent but not necessarily identical bidders i.e., bidder #j"s value vt 
has statistical distribution is Hi with density hi. We count the seller as a dummy bidder 
#0 whose value distribution is entirely concentrated at v and set t Q = u = 0. 

When participating in the auction, each bidder will bid optimally so that the previous 
results will apply. We denote (p t {v) the probability that bidder #i wins the object when 



the vector of values is v. Thanks to the dummy bidder trick, the seller's revenue is also 
her producer surplus 

W s = E[Y. i > (p i (v)R m , i (v i )} -Li> Ui(0) (22.10) 

where R m j is computed as in (22.5) but using the distribution function H t . 

An optimal (revenue maximizing) auction is now easy to identify: for every vector 
v, the item should go to the bidder whose marginal revenue Rmjivt) is greatest; this 
includes the seller herself if no marginal revenue surpass ^o- 15@ It remains to build 
an auction where this outcome is implemented for each possible combination of private 
values. 

Consider the modified second-price auction where each participant bids a value v\ 
(openly or secretly), the seller then computes the marginal revenues r, = Rmjivt) for i > 1 
and ro = v . Assume for simplicity that after ranking these marginal revenues we have 
r x >r 2 > ... > r n ; let then q 2 be defined by R mt \{r 2 ) = q 2 . The object is awarded to bidder #1 
at the price p 2 = Hf*(l - q 2 ) as shown on Figure 22.2. Notice that since the price is always 
positive, a bidder with minimum WTP never wins it, thus derives a nil utility so that 
ud0)=0. 
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Figure 22.2: Optimal Auction Design 



We claim that everybody has an incentive to announce truthfully his value in this 
auction. There are two cases to consider; either you are a winner, say bidder #1 with 
the truthful bid v \ or a strategic bid. In the first case, the price paid is a function of r 2 
not r\. As can be seen on Figure 22.2, overstating V\ by announcing v\ > V\ has no effect 
whatsoever. Understating v i by announcing v\ < v x will have no effect either meanwhile 
v\ > p 2 but if the understatement is too strong, the item will go to someone else since the 
computed marginal revenue f\ is now inferior to r 2 ; this represents a loss since telling 



the truth leaves a net surplus V\-pz> 0. Now, for a bidder, say #3, who does not win 
when truthfully bidding v 3 , there is nothing to do. Indeed, the only way to win the object 
is to outbid the current winner (#1) with some bid v 3 that has to be greater than p 3 the 
price computed using Hs and the winning marginal revenue r\ which itself is larger than 
the true valuation v% as can be seen on the right panel of Figure 22.2. 16@ 

When the auctioneer has absolutely no discriminating information regarding the par- 
ticipants, the distribution functions are identical. This implies that the highest value 
translate into the highest marginal revenue, thus our optimal auction becomes the Vick- 
rey auction with an optimal reserve price. By the revenue equivalence theorem, the 4 
standard auctions are optimal. This result is correct for private as well as common value 
settings. 

An efficient auction is one that maximizes welfare W = E[£;>o Vi(p t {\)\. Thus, it is 
enough to award the object to the highest WTP, including the seller as a dummy bidder 
with value v Q . 

22.3.3 Bilateral Trade under Uncertainty f 

Informed Trade 

When two people are interested into trading an item, it is enough that the value b for 
the potential buyer be larger than the value s to the current owner (and potential seller); 
they might agree on the price ^±2 if they have equal bargaining abilities (cf §2.4.3). 

This simple analysis underlies the case for "free trade" or (economic) "liberalism": 
whenever something in the economy is owned by someone, say a man, unable to make 
good use of it then someone else, say a woman, will offer to buy the item. The rationale 
behind this offer is that she thinks she's able to create more market value from the item, 
hence she is ready to pledge enough money to buy it because later on she will be able 
to recoup the investment. The immediate conclusion is that initial ownership does not 
matter, only ability does. 17@ 

This story runs into a difficulty as soon as the valuations b and s are not public 
knowledge. If each agent only knows his or her own valuation then bargaining over 
the exchange price is more difficult and can sometimes fail to implement the efficient 
outcome i.e., the item might be exchanged when it should not be and conversely, it might 
fail to be traded when although it would have been desirable. 



Information Revelation 

The Vickrey (1961)-Clarke (1971)-Groves (1973) mechanism (VCG) provides a solution 
to this problem and more general ones; it involves a broker and the two agents. Each 
agent announces his value, b for the buyer and s for the seller. Trade occurs if b > s in 
which case the buyer has to pay s to the broker who pays b to the seller. Exactly as in 
the case shown in Table 22.1, no one has an interest to lie because the payment does 
not depend on one's own announcement and lying can only bring inefficiency in trade. 
The item is therefore traded exactly when it is efficient to do so (iff b > s); yet the broker 
exactly loses b-s, the gain from trade. Hence, efficiency can be reached but it requires 
some prior funding by the parties to make the broker willing to play his part. 

To see if there is a way around this deficit issue, we assume the individual values 
b and s are drawn from a continuous distributions Fb and F s with support [v; v] for v = 
b,s. Both parties know the statistical information regarding their partner' value but 
do not observe its actual realization. If b > s then it is always efficient to sell the item 
and partners know perfectly this fact: although the buyer ignores the exact value v , he 
knows that b > v for sure and likewise, the seller who ignores the true b knows that 
b > v. Somehow, the asymmetry of information does not matter; a balanced and efficient 
mechanism is a sales contract stipulating any fixed price p e [s;b]. Since b > p and p> s 
for sure, it is always in the interest of both the seller and the buyer to sign this contract, 
whatever their own valuation and without worrying for the actual value of the partner. 
The price will be negotiated ex-ante using the expected values of b and s; for instance, 
the fair division price is p = E[s + b] . 

Problematic Trade 

The more realistic but more problematic case is that where ownership swapping is not 
always the efficient decision (b< s). Chatterjee and Samuelson (1982) (CS) consider the 
following simple exchange mechanism whereby the seller asks a price p s , the buyer offers 
a price pb and exchange takes place for the price Ps+ 2 Pb if and only if p s < p b . 

These authors show (cf proof) that the optimal strategies are to announce the non- 
truthful prices pUb) = \b + ^t an ^ P*s^ = I s + ~nT- I* 1 equilibrium, trade occurs at price 
Ps ~* 2 Pb - S+ 6 +S+ ~ , if and only if pb < p s o b- s > ^= which is inefficiently rare as soon as 
b < s; the fact that trade might be inefficient (socially undesirable) distorts the optimal 
bidding strategies away from truthful revelation which is the only way to be always 
efficient in trade. 

Let us now analyze the optimal bidding strategies. The seller can ask a very high 
price and make sure he keeps the item, thus he can guarantee himself his private val- 



uation s. One would think that asking more than the valuation (p s {s) > s) is an even 
better strategy, since trade takes place only if pb is larger so that the actual price Ps+ 2 Pb 
is mechanically greater than the valuation. The optimal strategy does not follow this too 
simple intuition; indeed, b < s, which is true, implies p* (s) < s i.e., the seller strangely 
understates his valuation when the latter is very large. The reason is that trade will 
actually take place only if b > s + ^= and in that case the price will be greater than — j^= 
i.e., for some unfrequent values of b, the seller will make a loss but this will be more 
than compensated by the increased frequency of profitable trades. The same observa- 
tions apply in a completely symmetric manner to the buyer. The crux of the problem 
here is whether trade is efficient or not is to be discovered by the parties, hence 

I The optimal strategies must not only concentrate on making a gain from trade but 
also on making trade happen. 

Limiting Efficiency Losses 

Generalizing this study, Myerson and Satterthwaite (1983) (MS) show that a bilateral 
trading mechanism cannot be at the same time efficient in trade, balanced in budget and 
guarantee the participation of both agents i.e., there is no way to eliminate the deficit 
issue in the VCG mechanism without introducing some inefficiency in trade (cf proof). 
The intuition uses our previous results. Suppose the broker tries to devise a trading 
mechanism that is always efficient and the least costly to him. The efficiency will be 
satisfied only if both the seller and the buyer reveal their private information to the 
broker. The latter then tries to buy cheap on one side and sell dear on the other. Provided 
that the broker has learned s, the buyer should get the item as soon as his own value 
is greater. As a consequence, the price paid is exactly s for otherwise some socially 
beneficial trade opportunities would be lost. In that case, the buyer receives all gains 
from trade. By an exact symmetry, the seller will also receive all gains from trade from 
the broker. The latter can do not better than loose the expected gains from trade. 

MS further demonstrate that the CS trade mechanism maximizes the gains of trade 
among budget -balanced mechanism guaranteeing participation of both agents (cf. proof). 
The previous result was akin to show that a natural monopoly forced to price at marginal 
cost would incur losses while this new result is akin to show that average cost pricing is 
the least damaging deviation from efficiency that avoids losses. As noticed by Bulow and 
Roberts (1989), the latter is a Ramsey-Boiteux problem. 

To fix ideas, assume b=s = 0, b = s = 1 and that values are uniformly distributed. 
In the first-best world (efficient trade with a sponsor broker), the gains of trade are 
Jo Jo (b- s)dsdb = |. In the second best world where trade occurs only if b- s > 1/4, the 



gains of trade are only f^ /£ lli (b - s) dsdb =B,a loss of S^ « 19%. 
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Chapter 23 
Entrepreneurship 



According to Schumpeter (1942), capitalism is a dynamic evolutionary process coming 
from within the economic system. It does not develop by adapting to exogenous changes 
but by mutating in a discontinuous fashion, succumbing to revolutions which displace old 
equilibria and structures to create radically new ones. For instance, the factory wiped out 
the workshop, the car superseded the horse and buggy, and the corporation overthrew 
the proprietorship. This process of creative destruction is the essence of capitalism. 

The key actor in this enduring vision is the entrepreneur whose characteristic trait 
is innovation, the ability to see external change as an opportunity, not a threat and 
therefore to do new things or perform old ones differently. 1@ This novelty often stems 
from invention or research but not necessarily given that its driving force is economic 
profit rather than intellectual satisfaction. The entrepreneur needs control over the 
means of production to "get things done"; in that respect, ownership is helpful but hardly 
necessary. The innovation can be the introduction of a new item (good or service), a new 
technology to produce an old item, a new commercial strategy (new market, new source of 
supply), a new internal organization or a new market structure such as monopolization. 

The reason why entrepreneurship is so important for government policy is because 
technical change and innovation explains much of the steady growth in advanced economies 
since the industrial revolution. 2 ® Most of the academic interest towards entrepreneur- 
ship is found in the finance and business literature although many important contribu- 
tions use the standard toolbox of information economics. 

In this chapter, we propose a self encompassing introduction to moral hazard and ad- 
verse selection within a single framework, the bilateral agency relationship between an 
entrepreneur (agent) and an investor (principal). The former, aka the agent, is innova- 
tive but penniless. He is thus forced to seek external capital from a wealthy but clueless 
financier to start up his project. The first section will look at the inefficiencies created 
by equity finance while the next one concentrates on debt. Our findings apply word for 
word to just about any firm inso far as managers can be assumed to perfectly represent 



the owners. The last section is thus devoted to managerial incentives. 

23.1 Agency Cost of Equity Finance 

In this section, we show that outside equity can be a source of moral hazard and adverse 
selection. The first segment sets the entrepreneur's playing field and relates the finan- 
cial perspective to the economics one. We then enter the heart of the matter to show the 
sharing of future profits through the emission of equity demotivates the entrepreneur 
and rationally lead him to under-invest. In the next segment, we explain why going to 
the equity market always conveys bad news about the quality of the firm. As a conse- 
quence, less money is raised and more generally, good firms are underpriced; this is an 
instance of adverse selection. Lastly, we study how the retention of a large fraction of 
the equity by a risk averse entrepreneur may act as a signal of quality to counter the 
previous "bad news" effect. 

23.1.1 Background 

Finance vs. Economics 

The neoclassical theory of the firm used in microeconomics focuses on decisions directly 
relevant to market competition and tend to treat profit as a function of production and/or 
price. The emphasis is thus more on revenue than cost; there is also a neglect of the fact 
that costs are disbursed before revenues accrue, thereby creating a need for liquidity. 
Corporate finance, as its name indicates, aims at solving this later problem for firms (cor- 
porations) while keeping in mind that the ultimate objective remains the maximization 
of profits. This amounts to invert priorities and focus on expenditure (aka. investments), 
treating revenue (aka. cash-flow) as a consequence. Corporate finance therefore studies 
what projects to undertake and how to finance them. In this paragraph we show that 
those views are dual or the two facets of the same objective, profit maximization. 

From a financial point of view, a firm's technology is an ability to implement projects 
i.e., perform investments into assets and later receive returns. We therefore identify 
a firm with a list of projects opportunities {Cj,rj)j< n where Cj is the investment cost of 
project 3 ® #j and r/ its rate of return (i.e., the return is c/(l + r/)). As illustrated on Figure 
23.1, one can rank the potential projects by decreasing rate of return. The total cost and 
total revenue of the first i best projects are respectively kt = T,j<t Cj and Rt = Ej< ; - c/(l + Tj) 
which satisfy 1 + r t = fEfr 1 - 

If we pass to the continuum by considering many small size projects, then R becomes 
a function of A; and this revenue-cost relation R{k) has derivative 1 + r{k). If the risk-free 
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Figure 23.1: Investment Projects and Profitability 



interest rate is r then the profit function expressed in net present value 4 ® is 



n(k) 



R(k) 
l + r 



-k 



(23.1) 



so that the optimal level of investment A;* maximizing profit solves r{k) = r . A compar- 
ative static exercise, useful later on, is to find out how A;* adjusts to changes in r . If 
the risk-free rate is initially r$ (cf Fig. 23.1), then the optimal investment is k±. If the 
risk- free rate drops to r 5 , the optimal investment increases up to k$ while if the rate goes 
up to r 3 then investment recesses down to /c 3 . 

It is now easy to relate the financial vision of profits to the economic one. Denoting 
p the market price of the commodity produced by the firm, the ratio of the net present 
value (NPV) of revenue to price q = J^ > can be interpreted as production. Inverting 
this relation yields the neoclassical cost function k = C(q) so that the profit is now 



n{q) = pq-C{q) 



(23.2) 



The optimal production is that solving p = C m {q) - p ^ + ^ - \^$: which, not surpris- 
ingly, leads to solve r{k) = r . Since r(k) is a decreasing function of A;, it is everywhere 
lesser than the average return, thus at the optimum, the firm's average return is strictly 
greater than the risk-free one r which means that the firm is making extraordinary prof- 
its. Such a situation characterizes a short-term equilibrium since in the long run, free 
entry enables competitors to claim some of these "high returns" projects and therefore 
reduces the overall profitability of the firm. 



Efficient Finance 

We study the contractual relationship of an entrepreneur (she) and an investor (he). 
The former has a limited personal wealth and owns an unalienable human capital sum- 
marized by a cash generating technology as discussed above. By its very nature, the 
entrepreneur's knowledge cannot be sold so that the investment decision can only be 
taken by her. 

To simplify notations, we denote R the present value of future cash-flow so that the 
project NPVis simply n = R{k)-k (as if the risk-free rate r was nil in (23.1)). As explained 
above, the technology has decreasing returns to scale (R m > 0,\) so that the efficient 
investment maximizing the NPV is k* solving the first order condition R m = 1 i.e., one 
should equate the productive value of l€ to its opportunity cost evaluated at market 
conditions. 5@ 

If the entrepreneur was rich enough, she would be able to afford the efficient invest- 
ment out of her initial wealth and efficiency would be reached (one speaks of a "first-best" 
situation). Nonetheless, the realistic case is precisely the opposite one and to simplify 
further we simply assume a zero initial wealth. The two basic ways to raise capital are 
debt where you promise repayment to a lender with an interest or emission of new equity 
where you agree to share future profits with someone else. The corresponding financial 
instruments are bonds and shares. The fundamental difference between them is the se- 
niority of debt over equity in case of bankruptcy (inability to meet financial obligations). 

23.1.2 Incentives to Under-invest 

Jensen and Meckling (1976) show that recurring to outside equity has an agency cost be- 
cause it leaves the entrepreneur with partial residual claimancy while the cost of putting 
time, effort or personal wealth into the firm remains the same. Thus, she is left with 
partial incentives toward investment. Indeed, if at the margin, 10€ invested by the en- 
trepreneur into the firm yields in return 15€ then it ought to be invested; now if the 
entrepreneur gets only 50% of it because she sold half of the firm to an outsider, she shall 
rationally forgo this investment. In a nutshell, "there is no way to make more than one 
person the residual claimant of an economic activity"; for that reason, incentives toward 
effort or investment are diminished for most members and under-investment occurs (cf 
also Holmstrom (1982a) on moral hazard in teams). 

To prove this result, we use Figure 23.2 where the NPV of the project is plotted as 
a function of total investment k. We denote k = k* the efficient investment and n = 
R{k ) - k the maximum NPV. To finance her project the entrepreneur sells a% of the 
project's future cash-flow to an investor who in return pledges an amount F. Even if these 



shares have voting power, the technology is uniquely controlled by the entrepreneur so 
that she remains the only one able to decide on the investment level. Her profit 6 ® is 
u a {k) = (1 - a)R{k) - k + F for any investment satisfying the financing constraint k < F. 
Neglecting this constraint for the moment (as if F was large), profit maximization leads 
her to solve R m = -^ > 1. As a result, she under-invests at a level k a < A; (recall the 
comparative static exercise of Figure 23.1). As can be grasped on Figure 23.2, the final 
NPV of the project, n a = R{k a ) - k a is lesser than the maximum n obtained with the 
efficient investment and furthermore, the larger the share sold to the outsider, the worse 
the resulting inefficiency 7 ® The intuition has already been given: since the entrepreneur 
has to share the profits of her firm with the investor, the marginal benefit from investing 
her personal wealth is lowered to (1 - a)R m while the opportunity cost of money remains 
1, thus the entrepreneur is demotivated and lead to under-invest. This distortion can 
be called an agency cost because the entrepreneur serves as an agent for the outside 
investor. 

A net return 



R(k) - k 




investment 



Figure 23.2: Equity Financing 

To close the study we only need to take care of the investor; he pledges F only if 
a is large enough but this leads to severe under-investment and a serious reduction 
of the cash flow generated by the project, so much that it might not cover the initial 
funding. Formally, the participation constraint for the investor is F < aR{k a ); it involves 
not just any value of A; but k a because the investor anticipates the future choice that 
will be undertaken upon his acceptance of a shares. Combining this condition with the 
financing constraint k a < F, we obtain a necessary condition 



k a < aR{k a ) 



a> cp{a) 



R(k a ) 



i.e., the offered share must be large enough. This constraint is obviously violated at a = 



since the entrepreneur is seeking funds but it is satisfied for a = 1 because a minimal 
investment is still worthwhile. 8@ Observing now that the ratio (p is decreasing 9 ® with 
a, the constraint is satisfied with equality for some < a < 1: this is the minimal share 
the entrepreneur must relinquish to get the capital necessary to undertake the invest- 
ment that will then be optimal given the shares she kept. Yet, she won't offer a greater 
portion because she realizes that the cake to be shared with the investor, the NPV n a , is 
decreasing with a and there is no way for her to increase her absolute share of it. 10@ 

Notice that in equilibrium, investors just break even, thus it is the entrepreneur 
who bears all the (agency) cost of his own inefficient behavior due to her inability to 
commit today to perform in a given way tomorrow. If she could credibly commit to the 
efficient investment k Q , investors knowing they will get a share of R{k ) would accept to 
pay F = aR{k ) for a% of the business. It would then remain to choose a = ^^ to satisfy 
the financing constraint. The crux of the problem lies obviously in the credibility of the 
announcement "I will invest fco", especially if it incorporates subjective elements such 
as human capital (the part of her working time really devoted to make the project a 
success). 

23.1.3 Equity Underpricing 

Myers and Majluf (1984) deal with the known difficulty of firms to raise capital when 
they have private information about their current profitability. Formally, this is a prob- 
lem of adverse selection similar to Akerlof (1970)'s lemons: if a firm issues equity to raise 
capital for an investment, it accepts to share future cash-flow with the new shareholders, 
a cash flow that comes from the new investment but also from the current assets. If the 
latter are very valuable, they will yield a lot of cash, so it not a good idea to share with 
foreigners; it is better to fund the new project with debt and emit zero additional equity. 
Potential investors understanding this logic, rightly suspect that an equity issue is a 
rather bad signal regarding the value of current assets; if the latter are not worth much, 
investors won't pay much for the firm's new equity either. If, contrary to this belief, the 
firm has really valuable assets, its equity will be underpriced, a phenomenon frequently 
observed in financial markets. 

To see formally this phenomenon suppose that a firm owns some capital assets that 
will yield tomorrow a cash-flow x > that is random from the point of view of investors; 
that is to say, the market faces a large population of firms whose future cash-flow x has 
a statistical distribution H. We study the consequence on the quotation of the current 
shares of going to the equity market to finance a new project whose lumpy cost is A; and 
whose positive return rate is r. 11@ 



The current true value of the firm is x, the cash-flow of current assets while its current 
market valuation is E[x], the expectation made by investors according to the distribution 
H. If the firm renounces to the investment when it was publicly known it had the oppor- 
tunity to undertake it (event no), the value of the firm remains x but the market value 
of equity becomes V no = E[x|no] because investors update their beliefs given the informa- 
tion just revealed by the "not going" decision. Likewise, if the investment is announced 
together with the emission of new equity (event go), the market value of actual equity 
changes to V g0 = E[x|go] + rk. The true value of the firm is now x + (1 + r)k but the value 
of old equity is only a fraction -^2- f it since it was diluted by the new emission that 
raised the amount k needed to undertake the new project. 

Comparing the two decisions and their payoffs, we see that the original owners decide 
to launch the new project only if the value they will retain tomorrow is greater than 
today's i.e., the go event is characterized by 

x< — — -(x + (l + r)k) o x<{l + r)V go (23.3) 

V g0 + k 

i.e., current assets are not very valuable. As intuition told us, it is not a good idea 
to sell valuable assets (large realized x) and indeed, the firm will rationally pass the 
opportunity to launch the new project. Since the no event is characterized by x > {l + r)V go , 
we can check 12@ that 

V no = E[x\x > (1 + r)V g0 ] >(1 + r)V go > V go (23.4) 

i.e., the decision to go is "bad news" and is chastised by the market. This explains why 
stock price falls when a firm announces a new emission of equity. 

Beyond the under-pricing issue which is a consequence of the revelation of private 
information by the firm to the capital market, we have an efficiency problem: a positive 
NPV project will not always be implemented because the no event has a positive proba- 
bility. If the firm could sell its old assets it would credibly reveal to the market the value 
of x; then it would not suffer under-pricing and efficiency would be restored since the 
new investment would always occur. Indeed, the fair value of the actual shares would be 
V go = E [xlgoU-x] + rk = x + rk since the market information contains the decision to go and 
the true value of the firm's current assets. Now, the condition x < (1 + r)V go is always true 
for r > i.e., every new project of positive NPV is undertaken. The value of the firm after 
the "go" decision would increase from x to x+rk because this time, the market rewards 
the decision to go on with the new project. 

One could nevertheless wonder why the decision to issue equity could not be the 
"good" signal that the firm has encountered a new project of positive NPV? The reason 
has to do with the possibility that the rate of return r is itself uncertain. The firm will 



never undertake an investment with r < because the capital would be better invested 
in riskless bonds. Yet for r = 0, the firm will go on whenever x < V g0 , so that the equity 
issue could still be bad news. 

The conclusion generally drawn from the previous model is called the "pecking order" 
theory stating that a firm should finance its projects internally before recurring to debt; 
equity should be used as a last resort only. 

23.1.4 Signaling Quality 

We leave aside the value of current assets and inquire the pricing of new assets (new 
projects). When the market is unable to distinguish the good projects from bad ones, the 
equity of good firms end up being underpriced which in turn might lead these firms to 
under-invest. Leland and Pyle (1977), in a model replicating Spence (1973) to the current 
setting, show that the entrepreneur can alleviate this information problem by retaining 
a large fraction of the firm's equity to convince the market of its intrinsic quality. This 
behavior leans on the well known fact that the founder is human, thus she is risk averse 
and prefers to replace the risky cash flow of her firm by the certain return of an equity 
sell. Now, if the entrepreneur keeps most of her equity, it must be true that the return is 
high enough to compensate her for the risk she is bearing. Lenders therefore value more 
the few shares put for sale. 

To demonstrate this claim, we consider a population of entrepreneurs displaying con- 
stant absolute risk aversion and facing normally distributed risk. As shown in §19.2.2, 
each maximizes E[u>] - |pV[u>] where E denotes expectation and V the variance of the 
random income w. Each entrepreneur owns a technology of two possible types, good (g) 
or bad (b), characterized by a random future cash flow Xj for i = g,b; the distributions 
have distinct expectations iu g = y, > \i h = (w.l.o.g.) but the same variance a 2 ; we can thus 
w.l.o.g. scale this parameter in order to eliminate the constant multiplier |p from fur- 
ther formulas. After selling (1 - a)% of its equity for an amount v t an entrepreneur of 
type i- g,b will have income wt = ax,{ + v\ with E[u>/] = a[i i + v i and \l[vbi\ = a 2 a 2 , thus her 
expected utility is Ui(a) = a^i i + v t - a 2 a 2 . 

If there were no asymmetry of information, the market would always distinguish the 
two technologies and being risk neutral, it would pay Vt = fx t . It is then obvious that 
each risk averse entrepreneur would be better off selling all her equity to the market 
in order to maximize uf, this would result in the optimal risk-sharing allocation. In 
the more realistic situation where the market cannot distinguish the two technologies, 
there are two possible equilibria called "pooling" and "separating". If the two types of 
entrepreneurs behave in a similar fashion, they leave the investors in the fog, the latter 



therefore put a single price for any equity and there is pooling of the types. If the two 
types of entrepreneurs behave in a very distinct manner, they might succeed to convince 
investors that one behavior is typical of a good firm and the other of the bad firm; there 
is separation of the types. 

In the first case, the single price v is an average between the value of a good and bad 
firm, thus a good firm is underpriced (and a bad one overpriced). In the second case, the 
market is able to price each type of equity at its real value; if we denote a, the share 
kept by an /-entrepreneur, we have v\ = //, for i = g, b. Compared to pooling, separation 
is desired by a good firm but feared by a bad one so that the former will try to force it 
and the latter will try to avoid it. Wise people say that "talk is cheap" which means that 
the only credible behavior available to an entrepreneur to signal the quality of her firm 
passes through the decision to sell more or less of her equity. The incentive condition 
to avoid that a bad firm pretends to be a good one is that her utility when lying (i.e., 
keeping a g as if she were a good firm) is lesser than when telling the truth (i.e., keeping 
a b ): 

ut,{a g ) <ut,{ab) o {I - a g ) fj, < a 2 {a 2 g - a 2 b ) (23.5) 

If the bad firm cannot micmick the good one then it is better off selling all of her equity 
to eliminate risk, hence we need only consider (23.5) with at = so that the condition 
becomes after some algebraic manipulations 



-^+vV(M + 4ct 2 ) , QQ R . 

a e >a = ^ (23.6) 

s - 2a 2 

Conversely a good firm should not be tempted to mimic a bad one by selling all of its 

equity i.e., 

/ a 
u g {0) <u g {a g ) o a g <a = \— (23.7) 

Algebraic manipulations 13@ enable to show that a< a, hence both conditions are com- 
patible for a £ [a; a] . Now given that risk-sharing is the motive for selling equity in the 
first place, a good firm will sell the maximum and retain no more than a% of its equity 
to signal credibly its quality to market investors (this also proves that only bad firms are 
tempted to mimic good ones). 

Given our convention, the parameter [i measures the extent of asymmetry of infor- 
mation and it is not difficult to check that a is increasing with n; hence, the stronger 
the information asymmetry, the greater the risk the entrepreneur has to bear (by re- 
taining more shares). When applied to the real world, our findings predict that in fast 
growing industries (e.g., IT services) which display large asymmetries of information, 
manager/founders of firms retain more equity than managers of firms in mature sectors 



(e.g., traditional industry) where there is little asymmetry of information. 

23.2 Agency Cost of Debt Finance 

We now switch instrument and concentrate on debt. If capital is raised from debt with 
face value d to be repaid at the interest rate r, the entrepreneur's profit is R{d) - (1 + r)d 
and the FOC of maximization is R m = 1 + r. Assuming a perfect financial market (no 
transaction cost) the price of money is the same for borrowing and lending i.e., r = 
given our previous convention regarding the PV of R. In that case, the incentives to 
invest are adequate since the optimal debt choice is d = k* , the efficient investment. 

In the absence of uncertainty (debt is riskless), lump sum finance such as debt is ef- 
ficient in the sense that the incentives of the entrepreneur to invest are not distorted 
(with respect to the case where she does not need external capital). This section will 
precisely introduce realistic features such as uncertainty or premium for borrowers to 
test the robustness of this efficiency result. Asset substitution generates excessive in- 
vestments on the part of the entrepreneur while debt overhang works in the opposite 
direction ; we also present a model blending the two effects to understand better under 
which conditions one is more likely to take place. Next, we look at credit rationing, an 
adverse selection phenomenon induced by debt finance. 

23.2.1 Asset Substitution and Free Cash Flow 

Intuition 

Jensen and Meckling (1976) is again the seminal article that first called the attention on 
the asset substitution induced by debt (aka. the over-investment effect of debt). In the 
presence of limited liability and future uncertainty, the entrepreneur is indifferent with 
respect to the size of bankruptcy and only cares for the happy times where she can pay 
her debt and keep all remaining profits. This leads her to undertake too risky and even 
unprofitable investments. 

To get the idea suppose that as a firm's manager you borrow 60 to finance a new 
project. You can either invest in a very secure idea yielding 100 and make a sure profit 
of 40, or, take a gamble with a risky project that either yield 180 or 0. Under the risky 
alternative you either win 120 or since you are protected by limited liability, hence 
on average you make 60 which turns you into a risk lover. Notice that you selected 
the inefficient project since its average return is only 90 < 100. This phenomenon is 
not limited to finance. In sports like football where matches have fixed duration, it is 
frequent to see the lagging team take increasingly more risk in a desperate attempt to 



catch up, the result being either an unlikely victory or a more presumable overwhelming 
defeat. The rationale is quite obvious whenever the ultimate goal is to achieve victory: 14 ® 
given the current score and the current field strategy defeat is almost certain, thus it 
cannot worsen to change the strategy into a more aggressive one that will increase the 
probability of winning although it also increase the probability of losing big (but who 
cares since the team would have been eliminated anyway). 15 ® 

Model 

To see the asset substitution formally, we modify the previously used DRS technology so 
as to account for market price uncertainty; the profit is n = pR{k) - k where the market 
price p is random with a distribution function H and mean E[p] = 1 (so as to maintain our 
initial convention). The expected NPV is 

/•+OO 

n* (k) = \ [pR{k) - k) dH(p) = E[p]R(k) -k = R(k) - k 
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so that the optimal investment is again the efficient level A;* solving R m {k) = 1. 

We consider a risk-neutral entrepreneur who has neither personal wealth nor is able 
to emit new equity to finance her project; she is thus forced to finance it entirely with 
debt. Following our convention, the risk-free interest rate is r = 0. We shall show that 
the uncertainty over the market price generates an agency cost in the sense that the en- 
trepreneur's optimal decision is now to over-invest (with respect to the first best decision 
k* that a wealthy entrepreneur would pick). 

As we explained in the previous examples, whenever the realized cash-flow pR{k) 
does not cover the debt obligation k, the entrepreneur defaults and walks out with a 
zero profit but no penalty since she is protected by limited liability. The cut-off level 
below which defaults takes place is p = ^fw. At the time where she must decide on her 
investment, she realizes that only the good times matter i.e., her expected profit is 
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n = {pR(k)-k)dH(p) = {R(k)E[p\p>p]-k){l-H(p)) (23.8) 
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We see that, with respect to a self-financed entrepreneur, she earns more because 
although ft and n* are the integral of the same surplus, the former goes over positive 
surpluses only. To ascertain the incentives towards investment, we look at the FOC of 
profit maximization: 
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= [R m {k)E[p\p>p]-i)[l-H{p)) (23.9) 

oR m {k) = -—^—>-^- = l (23.10) 

E[p\p<p] E[p] 

i.e., the debt-financed entrepreneur over-invests (recall the comparative static exercise of 
Figure 23. 1). 16@ 

This behavior generates default with a positive probability so that lenders do not 
recoup their loan on expectation; they will have to ask for a risk premium (r > 0) whose 
effect is to induce under-investment (cf §23.1.1 and next paragraph). Nevertheless, this 
(second) indirect effect remains dominated by the (first) direct one as long as the interest 
rate is determined by the participation constraint of lenders requiring they do not make 
a loss on expectation (cf. proof). Over-investment in this simple model has a multiplier 
effect on the rate of return of the project, it thus acts as a gamble (risk-taking) with 
respect to the efficient level. 

Free Cash Flow 

Jensen (1986) identifies another source of moral hazard, arguing the financial resources 
at the disposal of managers, the so-called free cash flow leads them towards over-investment 
Indeed, if managers have the power to decide on the use of cash flow, they are likely to 
waste it in projects with low return (negative NPV), into acquisitions to derive more 
power or worse, into perks (e.g., build a pool at home with the firm's money). Hence 
everything should be done to limit the quasi-rents at the disposal of managers in or- 
der to reduce their uncontrolled waste and force them to incur the monitoring of the 
capital markets when they must raise new capital to finance investments. This prob- 
lem is more important in sectors that generate large cash flows but have low growth 
prospects like regulated monopolies. According to Jensen (1986), it would be therefore 
desirable to force managers to pay large dividends to shareholders whenever they have 
free cash flow; but since promises are not credible (recall that "talk is cheap") one way 
of achieving this objective is to emit debt in exchange of a repurchasing of shares. This 
way, a former shareholder becomes a debt-holder and has the right to take the firm into 
bankruptcy court if it defaults on debt service. This swap of securities makes debt a cred- 
ible substitute for dividends and reduces the agency costs of free cash flow (inefficient 
over-investment). Empirical evidence support this view since share prices tend to re- 
spond positively to debt-equity swaps indicating that investors interpret these decisions 
as value enhancing. 



23.2.2 Debt Overhang 

Option to Invest 

A radically different point of view with respect to the role of debt in agency situations is 
taken by Myers (1977) who claims that an entrepreneur faces a debt overhang problem 
because at any point in time, investments are optional choices to be undertaken. So, 
whenever the net return of a project is lesser than the outstanding debt service, it is 
better for the entrepreneur to drop it and go bankrupt since she is protected by limited 
liability; somehow, she free rides on the investor. The overall effect is under-investment 
since some valuable projects are not implemented. 

To explain formally this phenomenon, we add a fixed cost c whose level is not yet 
known by the entrepreneur at the time where she decides on the investment; its distribu- 
tion function is H. 17@ If the entrepreneur gets financed exclusively by debt at the interest 
rate r (recall that the risk-free rate is zero under our convention) her random profit is 
ft - R(k) - c- (1 + r)k so that she is forced into bankruptcy whenever c > c = R{k) - (1 + r)k 
i.e., whenever the fixed cost realization is large. The cut-off c must be positive for other- 
wise the technology would be useless (at the current interest rate), but this also means 
that default occurs with positive probability. Hence, the expected repayment to lenders 
is strictly lesser than (1 + r)k, so that r must be positive to compensate their initial in- 
vestment of A;. The entrepreneur's expected profit being 

E[tt]=/ (R(k)-(l + r)k-c)dH(c) (23.11) 
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we have 

fljEjg] = R m (k)-(l + r) 

d k H{c) 

thus the optimal investment solves R m = 1 + r and since R m is decreasing and r > 0, it 
involves under-investment. 

To understand the difference with the previous over-investment result, notice that 
unlike a multiplicative shock like price, an additive shock like fixed cost has no effect 
on the marginal productivity, thus does not distort incentives. However, both generate 
default which is risk from the point of view of lenders; this means that they demand a 
positive risk premium whose (indirect) effect is to dampen incentives to invest. 

Empirically, firms in mature sectors (e.g., heavy industries) have free cash flow and 
little investment opportunities so that they tend to invest in negative NPV projects. The 
reverse holds for firms in high growth sectors (e.g., IT services) who lack funds but not 
ideas and cannot implement as many as efficiency would command. To conclude, over- 
investment should be more severe in mature industries while under-investment should 



be more severe in high growth industries. 

A rejoinder 

Loosely speaking our previous models conclude for over-investment occurs if the risk is 
multiplicative (e.g., market price uncertainty) and under-investment if the risk is addi- 
tive (e.g., fixed cost uncertainty). The following adaptation 18@ draws a clearer frontier 
among the two effects by blending together the original arguments of Myers (1977) and 
Jensen and Meckling (1976): over-investment is likely to happen if the firm's realized 
cash-flow is greater than expected (free cash flow) while under-investment takes place 
in the reverse situation (debt overhang). 

Assume that the entrepreneur uses long-term debt to finance a project developing in 
two stages, with a promise to repay d at the end. Ex-ante, she raises funds to run her 
normal business and to perform R&D with a view to develop a better technology. At 
the interim period, the current assets yield a cash-flow x, a part of which A; can be used 
to invest into the new technology R which is now available; the efficient investment k* 
solving R m {k) = 1 can be undertaken at the interim period. 

What creates a moral hazard problem at the interim stage (when A; is chosen) is the 
market price uncertainty that plagues every period: from the ex-ante point of view, the 
interim cash flow x is random, hence there is no way to tune the ex-ante investment so 
as to obtain exactly x = k* which is necessary in order to re-invest optimally into the new 
technology. Now, at the interim period, the entrepreneur disposes of a realized cash flow 
x and must decide how much to invest in her new technology knowing that the ex-post 
market price p will also be random (with unit mean). 

If the realized interim cash flow is x < k* , then the entrepreneur is forced to put up 
money to invest at the desired level. 19@ Since she faces the debt overhang problem, she 
won't put the missing k* -x but less, so there is under-investment. If, on the contrary, the 
interim cash flow is a generous x > k* , then the entrepreneur has free cash- flow and takes 
advantage of the existence of debt to gamble over the uncertainty of the final return; she 
undertakes excessive investment. 

To derive precisely these results let us study the optimal investment of the entrepreneur 
in two polar cases. By investing k < x in the high cash-flow scenario, her final wealth is 
Vh = max{0, pR{k)-d + x- k}. By investing k> x (putting up k- x out of her pocket) in the 
low cash- flow scenario, her final wealth is vi = max{0, pR{k) -d} + x- k. 

Let ph and pi be the solutions of pR{k) + x-k = d and pR{k) = d. The respective interim 



expected wealth are thus 



E[v h ]= P° [pR(k) -d + x-k) dH(p) if k < x 
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E[vi] = x-k + f™{pR{k)-d)dH{p) ifk>x 



The optimal investments in each case are kh and ki solving 
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Basic probability tell us that 
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/ pdH(p) < f pdH(p) = 1 = Up\ < E[p\p > p h ] 
Pl o 

so that since R m is decreasing, we have the ranking ki < k* < kh- 

A bit of logic is required to conclude. If the cash flow realization is large (x > kh) then 
the optimum investment is kh > k* , there is thus over- investment (yet never by more 
than kh - k*). Conversely if the cash flow is low (x < k{) then the optimum investment is 
ki< k* , there is under-investment (yet never by more than k* -k{). Finally when the cash 
flow is intermediate, it is entirely invested. 20@ There is thus either a moderate under or 
over investment depending on where x falls with respect to A;* . 

23.2.3 Debt as the Optimal Security 

Intuition 

We tackle here the problem of optimal security design i.e., finding the most efficient con- 
tract an entrepreneur and an investor might sign. Recall for instance that equity leads 
to under-investment by the entrepreneur either in physical or human capital because 
she is forced to share part of the future profits with the investor. Under the reason- 
able assumption of limited liability for the entrepreneur, Innes (1990) shows that debt is 
optimal to promote effort in situations of potential moral hazard. 

The result builds on two simple observations. When the cash-flow is low, the investor 
gets to keep all of it so that the entrepreneur's motivation towards effort is minimal. 
On the contrary, when the cash-flow is large enough to cover the debt obligation, the 
entrepreneur is the residual claimant of any increase in cash flow, thus is optimally mo- 
tivated towards effort. Obviously, efficiency corresponds to zero debt so as to make sure 
the entrepreneur is always the residual claimant. The need for external finance will nee- 



essarily introduce a distortion in the sense that for some values of the cash-flow realiza- 
tion, the entrepreneur will receive only a fraction of this cash-flow; as a consequence, she 
will have incentives to under-invest, thereby generating an inefficient outcome, deemed 
a "second best". 

Our original question boils down to decide where to put these distortions. It turns 
out that debt is optimal because it concentrates the distortion on low levels of cash-flow 
which represent small prizes, thus small disincentives while it makes the entrepreneur 
residual claimant for large cash-flow which represent large prizes giving large incentives 
toward effort. On Figure 23.3, we show four reimbursement rules: j d is the debt rule 
corresponding to loan d (bold curve), y a is the equity rule corresponding to the sale of a 
share a of future profits, 7 is the strange rule where there is no repayment until cash-flow 
reaches a minimum, then the repayment increases faster than cash-flow until it achieves 
a maximum; lastly 7 is a weird repayment rule that does not make any economic sense 
but which is nevertheless imaginable. Clearly, the debt rule is the closest to the diagonal 
for low cash-flows and then the farthest away for large cash-flows; this means that when 
compared to another rule, the debt rule is firstly above then permanently below. 
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Figure 23.3: Repayment Rules 



Model 

To prove formally this result, consider a penniless entrepreneur investing k into a project. 
The future cash-flow is a random variable x such that H{k,x) = Pr{x <x\k), the probabil- 
ity to observe a result lesser than x depends on the investment A;. For any function /,we 
denote E[/(jc) I k] = ff[x)dH[k,x). 



An investor pledges A; in exchange for a repayment rule 7 function of the future cash- 
flow. Popular rules shown on Figure 23.3 are debt with y d {x) = min{x,d} or equity with 
y a {x) = ax. The restrictions for an admissible rule are y(x) < x because the entrepreneur 
is protected by limited liability (she has no collateral to pledge to her creditor) and 7 > 0. 
To understand this later property imagine that 7 is decreasing at some level of cash-flow 
x; the creditor could then artificially reduce the available cash flow down to x by calling a 
costly external audit and get a contractual repayment y(je') > y{x). The actual repayment 
rule would, at most, be flat but never decreasing. The investor's expected repayment is 
E [y(jc) I k] so that the entrepreneur's expected profit is n[y, k) = t[x- y(x) | k] - k. 

Assume that the agreed rule 7 is optimal and is not a debt rule. The entrepreneur elic- 
its the investment k maximizing 71(7, A;), i.e., solving R m [k) = 1 + — ^ \ Since E [y d \ k] is 
increasing with leverage d, there exists a unique debt level d such that E [7 1 k] = E [y d | k] . 
By construction of a debt rule, the difference f{x) = y{x) -y d {x) is negative for small x, 
zero for some y value and then becomes positive for x > y (check on the right panel on 
Figure 23.3); it displays the single crossing property (SCP). 

We now assume that a higher effort induces a change in cash-flow distribution sat- 
isfying (MLRP) 21@ so that we can conclude (after Milgrom (1981)) that the expectation 
of the random variable fix) is increasing with the investment A;. In other words, its 
derivative with respect to A; is positive which given the definition of / reads 

aE[ T |fc] ^ dE[y d \k] 

~dk~ > dk {2S - 1S) 

Property (23.13) of the cash-flow distribution means that a small additional invest- 
ment increases the expected debt repayment less than under any other rule. Thus, this 
small additional investment increases the entrepreneur's expected profit faster under 
debt finance than under any other rule i.e., 

dn(yj,k) dn{y,k) 

{ -±^-L > \^-L (23.14) 
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By optimality of A; under the rule 7, the RHS of (23.14) is nil, meaning that under 
the new debt rule y d , the entrepreneur can increase her expected profit by choosing an 
investment k > k which is welcomed by the investor since E [y d | A;] > E [y d \k] = E [7 1 A;] . 22@ 
This fact ends the proof that the original rule 7 was not optimal. 

Risk Aversion 

If we wish to take into account that the entrepreneur is most often risk-averse (relative 
to the investor), Matthews (2001) shows that debt optimality continues to hold if the 



original contract can be renegotiated by the entrepreneur after she invested but before 
the cash flow is realized. Consider a non debt candidate optimum y that is renegotiated 
towards a contract . The objective of the entrepreneur at that point is to eliminate risk 
which would require 0(x) = x-cte but this would violate the investor limited liability, 
thus the entrepreneur would choose something akin to the inverse of debt with 0(x) = 
max{0, x-cte} tuning the constant so as to generate acceptation by the investor. The 
problem then is that this new final contract generates poor incentives to invest (it's the 
opposite of debt!) and therefore cannot raise a lot of money from the investor. A mid-point 
will have to be struck to preserve investment incentives with respect to risk sharing. 

Since the entrepreneur final payoff is n{(p , k) = E u{x - (p r (x) - k) \ k for some concave 
function u, the investor expects the investment A;* to maximize n{(p ,k); he will thus 
accepts r only if E y | A;* > E [y\ k*]. If now the initial contract is changed for a debt one 
y d such that E [y d | k* ] - E [7 1 k*\ then the entrepreneur can still invest A;* and offer in 
renegotiation which proves she can't lose from the change towards debt. We still need 
to show that the investor won't fear a change of investment that is bad for him, thus 
making the initial offer uninteresting. 

Let cp d and k d be optimal after y d i.e., k d maximize n{(p d ,k) and E[0 d |fcd] > E[y d | k d \. 
If, on the one hand, n{(p d ,k d ) = n{(p r ,k*) then (0 ,k*) is optimal after y d i.e., the change 
to debt does not change the final contract nor investment, thus the investor does not 
lose. If, on the other hand, the change to debt is profitable for the entrepreneur then she 
invests more 23@ which is unilaterally good for the investor. 

23.2.4 Credit rationing 

Credit rationing as well as unemployment are important preoccupations for macroe- 
conomists but also for micro-economists since these long lasting disequilibrium phenom- 
ena fail to be explain by the classical general equilibrium theory. Inspired by Akerlof 
(1970)'s lemons model of the used car market (cf. §21.1.1), Stiglitz and Weiss (1981) 
show how asymmetries of information among borrowers and lenders can create a market 
imperfection that limits the volume of credit and endogenously generate credit rationing. 
In a credit market, like any other market, demand is decreasing in the nominal in- 
terest rate while supply is increasing with the effective interest rate (return on unit 
loan). In a perfectly competitive market where asymmetries of information are absent, 
the two kinds of rate are identical. The novelty is to demonstrate how uncertainty about 
the quality of lenders can make a difference between the nominal and realized interest 
rates. More precisely, it might be the case that the realized interest rate reaches a max- 
imum such that the corresponding (larger) nominal rate generates excess demand for 



credit. There is credit rationing because no lender agrees to loan more as he knows this 
would only lower his effective return. 

The story behind this phenomenon builds on the well known positive correlation be- 
tween leverage and default. Understanding this relationship, lenders tend to believe 
that firms asking much credit are signaling a high probability of default. This in turn 
leads them to ask for a large risk premium. The problem with this attitude is that 
it may affect the average quality of applicants (adverse selection) and their behavior 
once financed (moral hazard). As we shall demonstrate afterwards, raising the nomi- 
nal rate does not reduce the demand for credit in an even manner because the safest 
entrepreneurs drop out so that the pool of remaining applicants are of lower intrinsic 
quality and worse still, they take more risk than ever. These negative effects diminish 
the effective interest rate, so much that it might be the case that they outweigh the 
original nominal increase. 

The adverse selection effect is quite similar to the asset substitution effect: riskier 
projects are more profitable on expectation whenever the entrepreneur uses debt and 
is protected by limited liability. To see this formally, consider two projects looking for 
the same funding k with random cash-flows x and y, that have the same expected value 
i.e., E[y] = E[x]. Since the repayment of debt R{x) = min{(l + r)k,x} is linear then constant, 
it is concave, thus the entrepreneur's profit, n{x) = x-R{x), is convex. Rothschild and 
Stiglitz (1970) introduce a statistical notion of riskiness such that if y is more risky than 
x, then E[7r(j/)] > E[ti{x)]. 24@ When a lender increases the nominal rate r, firm profits are 
lowered, hence also fall on expectation so that the less risky project x drops out i.e., cease 
to demand credit. This means that the average riskiness of applicants increases (only 
y remains) and therefore the average expected repayment drops (at constant interest 
rate). 25@ 

If there are only two classes of risk, safe and risky people, there is a cut-off nomi- 
nal rate, say f where safe people drop out; at that point the average repayment falls 
much more than the gain generated by the nominal rate increase. In other words, the 
realized interest rate reaches a maximum at f . The optimum for lenders is therefore to 
offer the highest nominal rate that guarantee participation from all entrepreneurs i.e., 
r. This also means that the supply of funds has reached a maximum because supply is 
an increasing function of the effective rate. Yet this optimal rate proposed by lenders 
generates a demand for funds larger than the supply so that credit rationing occurs. 

Regarding moral hazard, notice that after a nominal rate increase, the structure of 
profits becomes more convex, 26@ thus motivate entrepreneurs to gamble and choose more 
risky projects thereby worsening the expected repayment to lenders and the realized 
return on their loans. Indeed, we have E[7r(x)] = fa+r)d^ x ~ ^ + r)d)dHi(x) where Hi is the 



distribution function of the random variable x. By the definition of the cut-off where the 

entrepreneur defaults, 

dE[n(x)] 

— = - (1 - Hi ((1 + r)d))d<0 

or 

hence 

dE[n(x)] dEMy)] 

— - < — o Hi((l + r)d) < H 2 ((l + r)d) 

or or 

where H 2 is the distribution function of the random variable y. If, at the initial interest 
rate, the firm was indifferent between two projects x and y (n{x) = n{y)) then after the 
increase of r, the project with a higher probability of default y is preferred to the safer 
one (profit was reduced for both but less for y). 

This Stiglitz and Weiss (1981) result is however highly sensitive to the nature of the 
uncertainty. If the screening process of lenders identifies returns but leaves doubts re- 
garding the probability of success, de Meza and Webb (1987), show that too many projects 
are undertaken. 27 ® The reason is quite simple: when the return in case of success R is 
known the entrepreneur's profit is increasing with p, thus the projects that ask for fund- 
ing are those above the participation threshold, contrary to what happens in the Stiglitz 
and Weiss (1981) case. The reverse equilibrium sub-optimality then follows. 

23.3 Managerial Incentives 

Success in business is rarely immediate which means that an entrepreneur needs time 
to build experience and try different options before she can hope to hit the jackpot, so to 
say. We saw previously that equity finance had a demotivating effect but debt financing 
is not perfect either. Indeed, since cash-flow is likely to be low during the first years 
there is a serious possibility of defaulting on the debt obligations. Now, going bankrupt 
is a very dark prospect for the entrepreneur. Not only does she loses the prestige of her 
position but above all she loses all the human effort she invested in the firm; in other 
words, the human capital she amassed is complementary to the physical assets of the 
firm. 

A similar albeit weaker argument holds for the manager of a dispersedly owned 
firm: failing to maximize profits in the presence of debt increases the probability of 
bankruptcy, thus the probability of losing one's job and the perquisites associated with 

it 28@ 

The first two works presented below use this observation to show that debt can allevi- 
ate agency problems of adverse selection and moral hazard. In the third part we present 
the efficiency wage theory explaining how asymmetries of information force firm to pay 
higher than necessary wages (and also generate unemployment). 



23.3.1 Debt as a Signal of Profitability 

Ross (1977) in an early application of Spence (1973)'s signaling theory, shows how the 
manager of a firm can use debt to signal the profitability of her firm (cf. §21.1.3). 

In the absence of uncertainty and under complete information regarding profitability 
the value of a firm is correctly assessed by the market and the financial structure (debt 
or equity) does not matter. 29@ If now the profitability or future cash-flow is a private 
information of managers, the market will price each firm at an average which means 
that a profitable firm will be underpriced. Most attempts to signal a high profitability to 
the market will be mimicked by lower quality firms. In that situation, debt can help a 
manager. To see this, imagine there are only two firms, one "good" and one "bad" with 
certain future cash flows x and y < x. The "good" manager needs to emit debt d e]y;.x] to 
avoid bankruptcy but make sure that an imitator would surely go bust. Then it must be 
the case that bankruptcy is a concern for her to tell the market that her choices reveal 
her desire to avoid bankruptcy. As we previously argued, this is naturally verified for 
an entrepreneur. As for a manager, she must tie her own remuneration in a significant 
way to the final cash flow of her firm to convince the market that she cares to avoid 
bankruptcy. Her remuneration contract might incorporate a share of the bankruptcy 
costs. 

The formal model uses a continuum of types r uniformly distributed over [a;b]. The 
future cash-flow of a type t firm is a random variable x T uniformly distributed in [0;2t]. 
Given a debt level d, the terminal value of a r-firm is x T if she successfully repays her 
debt (case x T > d) and x T - L otherwise, L being the cost of reorganization generated by 
defaulting on the debt obligation. The expected final value is thus computed as 
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assuming d < 2t (debt lesser than maximum cash-flow). 

Let us consider the remuneration w = y V + j l V\. In equilibrium the choice d T of a 
type-r firm must be optimal for the manager, hence |j = o Yo v o + Ti v [ = ° ° V(,{d r ) = J^. 
Since in equilibrium types are revealed we have V {d T ) = x hence V^(d T )d' T = 1. Plugging 
into the previous equation we obtain d' T = -p£, thus d T = yl- + c where c is the integration 
constant. Taking into account the fact that the worst type will not emit any debt to 
eliminate the risk of bankruptcy, we derive d T = - L — . Lastly we must check that 
our initial assumption d < 2t is satisfied which requires eliciting y 1 large in front of y 
i.e., the manager's remuneration must strongly depend on the future where bankruptcy 
might happen to credibly transmit information to the market as regard the type of her 
firm. 



23.3.2 Debt as a Signal of Obedience 

A moral hazard issue tantalizing financiers is the possibility that the manager of a firm 
follows the pursuit of happiness rather than the pursuit of benefits. As we explained 
in the introduction, it is reasonable to assume that bankruptcy is costly for a manager- 
entrepreneur; this fact leads Grossman and Hart (1982) to argue that issuing debt is a 
pre-commitment or bonding behavior aimed at convincing investors that the firm will be 
managed to maximize profits, so as to avoid bankruptcy. Hence, a high leverage could 
signal the good prospects of the firm and be the guarantee that investments will be 
carried on at the efficient level. 

To check formally this claim, we consider first the manager of a firm that is a distinct 
from the owners. The firm (aka. the owner) raises an amount F of funds by selling 
a mix of equity and a debt obligation d. The moral hazard issue here is the fact that 
it is the manager who allocates k into physical capital and the remnant F-k into her 
human capital which we interpret as a private benefit since it is an unalienable asset 
that cannot be taken away by the owners. 30 ® What disciplines the manager is the fact 
that she will enjoy her private benefits only if she remains at the head of the firm. 

The project's future cash flow is R{k) + x where x is a random shock of zero mean 31 ® so 
that the expected NPV of profits is R{k) - k. Given the uncertainty regarding the future 
cash flow, the firm will go bankrupt whenever R{k) + x < d, hence the expected utility of 
the manager is the average of her perks F-k over the states of nature where she enjoys 
them i.e., 

E [F- k\R(k) + x > d] - (F- k) (1 - H(d -R(k))) 

The optimal investment k solves the FOC 

{F-k)R m {k)h{d-R{k)) = l-H{d-R{k))<>———— = (p{d-R{k)) 

{F-k)R m {k) 

where <p{x) = jz^h) is the hazard rate of the distribution function H. 32@ In equilibrium 
of the capital market, investors anticipate the choice k, thus value the firm's securities 
F as the present value R{k) of the project; the first order condition determining k then 
becomes {R{k] _l ]R ik] = (f>{d-R{k)) and we observe on Figure 23.4 that an increase in debt 
from d\ to d 2 moves the RHS up, hence the optimal investment increases from ki to k 2 ; 
we can therefore conclude that the leverage chosen by the entrepreneur is a commitment 
to invest mostly into the firm's future value and not into her personal satisfaction. 

Adding a greater degree of realism is possible without changing the qualitative na- 
ture of the result. Firstly, the utility for the manager need not be linear in the perks 
she keeps for herself, it can be u{F - k) for a concave increasing utility function. Then 
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Figure 23.4: Bonding Behavior of Debt 



the FOC becomes ,„"' ( £ n *°„., = (p{d-R{k)) but since the ratio 



u'jx) 



u(F- k] R m (k) -w» «w„ — — ^ -„» u[x] is a decreasing func- 
tion of x, our previous conclusions remain unchanged. Then we are able to consider the 
case of an entrepreneur instead of a manager; in that case the utility she maximizes is 
E[u(F-k) + R(k) + x-d\R(k) + x>d] so that the FOC becomes ^gl^^g' =<f>{d -R(k)) and 
once again, the qualitative effect of raising the leverage remains a commitment to later 
choose a greater investment. 



23.3.3 Efficiency Wage 

Intuition 

The persistence of unemployment in competitive economies has long been a puzzle that 
standard neoclassical theory could not solve. Indeed, in the many economies around the 
world that do not guarantee a minimum wage, why can't the market wage descend low 
enough (with subsistence as a lower bound) to stimulate demand and thereby provide 
a job to every person who seeks one? Shapiro and Stiglitz (1984) offer an innovative 
explanation based on incentives and asymmetry of information. 

To understand their theory it is useful to work by contradiction. If there existed an 
equilibrium with full employment, a fired worker would be instantaneously rehired at 
the same equilibrium market wage. Thus, in terms of utility, she would barely notice the 
change which means, adopting the firm's point of view, that there is no way to penalize 
a worker caught shirking (not working as hard as stipulated in her contract). Thus, 
to motivate hard work, firms are forced to pay above-market wages in order that the 
loss of one's job be painful. But, as always in economy, if one firm finds it attractive to 
pay above-market wages, then all firms will do the same. This means that the market 
wage increases and exactly matches the wage paid by each firm, hence work incentives 



have been destroyed once again. However, wages being above their natural equilibrium 
values, labor has become a more expensive input (lower productivity) and its aggregate 
demand falls generating unemployment. Now, the consequence of being fired (for being 
caught shirking) is more dreadful than before because the individual will have to wait 
before finding a new job and will have to live poorly in the meantime. 

We have thus seen that the wage is not only the price of the labor input equating 
demand and supply but also a device to provide work incentives inside the firm. As in- 
tuition suggests, when one instrument is used to solve two problems, some inefficiencies 
are bound to appear. 

Model 

To check this claim, consider identical risk neutral workers whose utility is, w - q where 
w is the wage and q the effort they exert (0 if jobless). The common rate of discount 
is r. For simplicity, the efficient effort is a fixed level q > 0. At each period, a person 
can be unemployed (type u) or enjoy a job (type ;') in which case he will either work 
obediently (o) or shirk (s). Jobless people receive a benefit b that can be provided by the 
State or family. There is an exogenous probability p of losing one's job (e.g., the firm goes 
bankrupt) and a probability A of being caught by the monitoring technology if shirking. 
The value of being employed and unemployed are denoted Vj and V u ; the present 
value of remaining in a given situation forever is V x £jt>i 1 k = rV. For an employed 
"shirker" with present value V s we also have 

Vj=w-{p + X){Vj-V u ) (23.15) 

i.e., what she expects is equal to her current wage minus the expected loss of utility 
relative to the possible job termination that might occur at the end of the period (if caught 
shirking). For an "obedient" worker, the effort must be accounted while the probability 
of losing one's job is lower, thus the equation is 

rV?=w-q-p{V?-V u ) (23.16) 

Solving the two equations so obtained we derive 

w + {p + X)V u , _ w-q + pV u ,^ „„. 

V = - — and V° = " F (23.17) 

1 p+A+r J p+r 

Shirking won't take place only if V° > V s o w > w = rV u + \ ■ W e can now assume 
that employers will pay exactly w to avoid shirking by their employees. 



For a jobless person, there is a unique equation since he does not have to decide if 
he shall work or shirk. Letting a denote the job acquisition rate, the value of being 
unemployed satisfies 

rV u = b + a{Vj-V u ) (23.18) 

Since in equilibrium, workers do not shirk we have Vj = V?; (23.17) and (23.18) form 
a system whose solution is 



Vi 



(u>- q){a + r) + bp 
a + p + r 



and V u 



(w- q)a + b{p + r) 
a + p + r 



(23.19) 



Replacing the latter formula into the definition of w further yields w - b + q + g+p A +r - . 
In terms of comparative statics the critical wage must be larger when work is more 
painful (larger q), when the unemployment benefit b is larger, when the probability of 
being caught shirking A is lower, when the interest rate r is larger (preference for the 
present) and when the economy is more unstable (larger p). 

Lastly, using the total worker population N and the employed population L, we can 
derive a since in equilibrium, the flows out of unemployment equal the flows in so that 
a{N- L) = pL^> a - j^j- and w = b+ q + f (]fzx + r • As we can see on Figure 23.5, this 
relation draws a frontier L = S{w) between combinations of employment and wages that 
induce shirking or work inside firms. We can interpret this curve as a labor supply 
function. 
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Figure 23.5: Efficiency Wage 
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To derive the equilibrium on the labour market we have to consider the behavior of 
firms; we only need to assume that the marginal productivity of labour is decreasing 



with employment to obtain a downward slopping demand curve D. Also the productivity 
of the fixed effort q must be large enough to warrant full employment at the complete 
information equilibrium i.e., D{b + q) > N as represented on Figure 23.5. Given the moral 
hazard issue of shirking, firms are forced to add the incentive constraint L < S{w) to 
the traditional technological constraint L < D{w) i.e., they limit their demand schedule 
to the upper part of the D curve so that the equilibrium is a pair {w*,L*) generating 
unemployment. 

Some interesting observations are: 

• All the above comparative statics regarding conditions that increase the critical 
wage turn out to increase unemployment. 

• Jobless workers would accept a lower wage but since they are unable to commit not 
to shirk, firms prefer to let them out. 

• During a recessions the labor demand moves down which lowers wages but the 
probability of shirking rises (larger p) so that unemployment is doubtlessly in- 
creased. 

• The equilibrium is inefficient because firms employ too few (they equate w* not 
e to their productivity) and because each of them causes a negative externality 
on others given that V u is increased for other firms when one of them hires more 
people. 

If the workers are the communist owners of firms then the optimal level of employ- 
ment maximizes workers utility {w - q)L + b{N - L) under the "no shirking" constraint 
w > w and the industry participation constraint wL < O(L). Given that w> w^> w> b + q, 
the social objective turns out to be the maximization of employment under the two pre- 
vious constraints i.e., the optimum solves w{L) = <${L)/L. For most industries, there are 
diseconomies of scale so that the average productivity is greater than the marginal one 
and leads to a greater employment than at the market equilibrium. If workers and own- 
ers are different economic agents then the only change in the above Pareto program is 
wL < O(L) +6 where > is a non negative parameter. As a result, optimal employment is 
reduced but it still remains above the market equilibrium level. Hence we may conclude 
that asymmetric information and costly monitoring generate a market failure creating 
too much unemployment. The government is thus warranted to intervene to reduce un- 
employment by using (not too distorting) taxes. 
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Chapter 24 

Standards and Components 



Listening to music, taking photos, watching a video, using a computer are occupations 
involving bundled goods because each of these activities uses a media for storage, an in- 
terpreter for treatment and a human-interface to enjoy. There are many other everyday 
life examples of components that are assembled to make up a final good e.g., a house, an 
airplane, a meal, a car or an electronic device. 

When two components can be successfully combined they are said to be compatible 
or to follow the same standard. More generally, a standard is a convention, a specifica- 
tion, a protocol, or an understanding that allows successful interaction between humans, 
between machines or between humans and machines. Agreement on a standard 1 @ is de- 
sirable for society as it enables: 

• interchangeability in consumption (CD, DVD, tape, disk) 

• interchangeability in production (car parts, electronic components) 

• ease of communication (telephone, keyboard, units, english language) 

Most final goods are sold as pre-assembled bundles because it is much cheaper to 
have the bundling realized by an industrial producer than by the end-user. 2@ However, 
the consumer's desire to mix and match himself the components has lead to increased 
connectivity, a concept we could define as compatibility made user-friendly For instance, 
loudspeakers can be connected to any hi-fi system or the SIM card sold by a mobile 
network operator can be inserted in any mobile handset. More connectivity means more 
markets and therefore more competition. 

Bundling is also present in services such as energy or telecommunications. Indeed, 
these are made of several components displaying complementarity as well as substi- 
tutability for the consumer. Due to network externalities (cf §25), their supply displays 
economies of scope and scale so that firms tend to be active in the market for the final 
service rather than the market for a single component. 3 ® 

In this chapter, we first look at the market power that a firm can derive from the 
complementarity of the various goods or services it sells. Next, we study the incentive 



for firms to agree or disagree on a standard i.e., whether to make their components con- 
nectable for the end-user. We also look at the pattern of adoption for a standard or a 
new technology. In the following section, we look at the critical mass issue and dis- 
cuss some myths related to over-stretched conclusions derived from some simple models 
of network externalities. Detailed examples explain why history and dynamics should 
never be lost of sight when dealing with standards. Our penultimate section analyzes 
two sided platforms which are intermediary networks trying to connect end-users and 
service providers. Finally, we conclude this chapter with social interaction, the effects 
of conformity and vanity on our everyday purchasing behavior; we show that the tradi- 
tional conclusions of demand theory can be altered by network phenomena. 

24.1 Components: Tie and Bind 

Before considering competition among firms on standard setting, we investigate how 
one firm with market power, for instance a monopoly, can use tie and/or bind as a price 
discrimination device to increase profits (cf also §4.1.3). Needless to say, bundling may 
also be triggered by technical complementarity, cost savings and network effects. An 
early reference discussing this strategy is Burstein (1960). 

24.1.1 Tying 

Tying (aka tied selling or pure bundling) is the commercial practice that conditions the 
sale of product B on the purchase of another product A. A frequent example is the in- 
stalled operative system (OS) on a computer; requiring an OS free hard-drive does not 
entitle the buyer to a rebate which means that hardware and software are tied. The 
practice although common has nevertheless been judged anti-competitive in the EU and 
the US in a number of cases among which: 

Hilti a producer of fastening systems used in the building industry, was abusing its 
dominant position by supplying cartridge strips only when purchased with the nec- 
essary complement of nails. 

Tetra Pak required its customers to use only the Tetra Pak's cartons with the Tetra 
Pak's filing machines and moreover, to obtain those cartons only from the supplier 
itself. 

IBM required the purchasers of its mainframe computers in the 1970s to buy exclusively 
its tabulating cards. 

Kodak tied the sales of part for machines to the sales of repair services for these ma- 
chines. 



Microsoft tied the "Internet Explorer" browser to the "Windows" operating system. 

In all the previous examples, a firm sells a unique durable good A (protected by 
patents) so that the firm holds market power. However, this durable good consumes 
a perishable good or service that can be delivered competitively. In the absence of tying, 
the monopoly can freely fix p A but is obliged by the competitive fringe offering good B 
to set pb = c, the marginal cost of delivering good B. This situation is quite similar to 
the monopolist using a two-part tariff; when consumers are homogeneous, the unit price 
is optimally set at the marginal cost to generate maximum demand and thus maximum 
surplus which is then siphoned through the subscription. The latter plays the role of pa 
while the unit price plays the role of p#. 

Now, in the more realistic case where consumers are heterogeneous, we already saw 
in §4.3 on quantity discrimination that the optimal pricing scheme involves increasing 
the unit price and decreasing the subscription so as to rip higher margin on all the 
consumers eager to consume good B, the consumable, in large quantities. Translated to 
the present situation, the firm would like to bind the purchase of the two items in order 
to be able to set Pb> c which is the absence of tying is impossible due to the competition 
of consumable makers. 

24.1.2 Bundling 

A commercial practice more flexible than tying is mixed bundling whereby each good can 
be purchased either as a separate item or as part of a bundle. 

Imagine that potential consumers belong to two groups i = 1,2 of equal size whose 
willingness to pay are w\ and w l B for i = 1,2. If individual characteristics were observ- 
able and discrimination was legal, the optimal prices would be the reservations prices. In 
the more realistic case where individuals cannot be distinguished, the seller must charge 
Pa = min{w\,w A \ if he wants to include everyone and similarly for good B. Bundling is 
almost always superior because it averages the differences among consumer groups; in- 
deed the optimal bundle price is pab = min{w\+ w b ,w a + w b } > pa + Pb as soon as there 
is some asymmetry in the consumers preferences. Examples of famous bundles are the 
Microsoft Office suite which bundles programs for typing texts, calculating with spread- 
sheets, maintaining databases and making presentations. 

Another case where bundling can appear is when the two goods show complementar- 
ity in use, an issue already seen in §3.2.3 on multiple products. Consider for instance 
the demand for the bundle D(p) = \-p where p is the price of the bundle. If the two 
components are sold separately at prices pa and ps then the bundle price perceived by 
consumers is p = Pa + Pb so that firms A and B receive demands q A = 1 - Pa - Pb and 



qB = ^-PA~PB- The inverse demands are Pa = \~Pb~ c\a and Pb = 1~Pa- 4 b', assuming 
zero cost, the optimal prices (found from the optimal quantities) are then p A = -^ and 
Pb = —^ A - The equilibrium is thus Pa = Pb = \ leading to industry profits of §. 

If the two firms integrate, then the usual monopoly price is \ leading to the greater 
profit of \ which could rationalize the fact that firms making complementary goods tend 
to merge. Noticing that sales increases, the merger is efficiency enhancing, contrary to 
intuition. The reason for this paradox is that the independent firms fail to account for 
the externality their own price impose on the demand for the other component; when 
competing one against the other they unduly restrict trade much like in the double 
marginalization problem (cf §14.1.3). 

Adams and Yellen (1976) illustrate this reasoning with the help of Figure 24.1 by first 
abstracting from cost issues. The distribution of WTPs for the two goods are shown on 
the two axes and are assumed to be independently drawn so that a point represent a 
consumer. Treating each good separately, the firm finds two profit maximizing prices p* A 
and p* B which divide the space into four zones. This is illustrated on the left pane. People 
whose WTP pair lie in (i) buy neither A nor B, those in (ii) buy B only, those in (iii) 
buy A only while those in (iv) buy both products. When applying pure bundling or tying, 
the firm finds an optimal price p* AB and only those consumers whose WTP pair satisfies 
u>a+ wb> P* ab will buy. Hence zones (a) and (/J) are separated by a line of slope -1 as 
displayed on the central pane of Figure 24.1. Lastly, when the firm combines the two 
previous approaches, consumers are sorted again into four groups with the novelty that 
group (iv) whose members buy the bundle is made of zones (a), (/}), (y) and {6) where 
people would, respectively, buy none of the separate goods but the bundle, only good B, 
only good A and finally those who under any circumstance end up with the two goods. If 
production cost are zero then mixed bundling with pab < P* A + P* B is a superior strategy 
since the a people now buy the bundle instead of nothing. 4 ® 

In the general case where values are not independently drown, bundling is attractive 
when values are negatively correlated. If some people display a low WTP for good A and 
a high one for B, they might buy the bundle instead of solely B. When this happens, the 
firm is making a loss over the good A which would not happen under pure component 
pricing. 

More generally, McAfee et al. (1989) show that mixed bundling is profitable relative 
to separate sales for the simple reason that offering more (cleverly designed) options 
to consumers cannot hurt wrt. optimal separate prices. Indeed, if we initially set the 
bundle price to be the sum of the optimal separate prices, it must be the case that one 
can increase profit with a slight change of bundle price. We already saw that with inde- 
pendently distributed values, it is optimal to discount the bundle price so as to generate 
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Figure 24.1: The advantage of Bundling over pure pricing 

additional sales (which is welfare improving). The more problematic case is when it is op- 
timal to raise the bundle price because then the firm must monitor sales to keep buyers 
from assembling the bundle by themselves (i.e., buying the independent components). 

Mixed bundling might be unfeasible because of antitrust pressure (cf case of dearer 
bundle wrt. separate items), technological reasons or transaction cost. 5@ Hence, the firm 
must decide whether to sell two good or one bundle good. Fang and Norman (2006) show 
that under likely assumptions regarding the distribution of WTPs, bundling reduces 
variance which is good for the firm when a good should be sold with high probability 
(either because costs are low or because valuations tend to be high). Conversely, the 
reduction of taste dispersion is detrimental when the goods have only a thin market 
(either because the costs are high or because valuations tend to be low). Thus a low 
marginal cost or high mean valuation make bundling more likely to be optimal. 

24.2 Creation of a Standard 



The International Standard Organization (ISO) as its name makes clear, organizes the 
activities of many research committees among which the Moving Pictures Expert Group 
(MPEG) and the Joint Photographic Experts Group (JPEG). In other instances, the stan- 
dard is developed privately as in the cases of CD and DVD which were the result of 
lengthy negotiations among industrials (cf. history of the CD-audio standard). In many 
instances, a single private initiative succeeds to grab a dominant market share and then 
becomes a "de facto" world standard. However, as the high-definition DVD story illus- 
trates, it sometimes happen that several private initiatives are developed competitively, 
offering incompatible standards. This situation is extremely inefficient as it retards the 
development of the consumer market until one of the contender is defeated. 



We shall consider in turn the competition among two substitutable candidates for 
achieving the status of standard, then we look at the effect of complementarity among 
the two products and finally at the issue of compatibility among the two products. 

24.2.1 Substitutes 

Two firms Apple (A) and Microsoft (B) endorse a different standard (Macintosh and Win- 
dows) for the operative system of their computers. The payoffs of their simultaneous 
interaction on the consumer market are: 
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If min{x,y} > max{r, s} then Mac for both or Bin for both are Nash equilibria while if 
max{je,y} < min {r,s} then asymmetric choices are Nash equilibria including Apple pro- 
ducing a Bin-compatible product and Microsoft producing a Mac-compatible one. 

This example illustrates the fact that if profits are high for compatible lines then 
firms are lead to cooperate on any standard. At the opposite if firms benefit from being 
the sole supplier of a given standard then cooperation does not appear. The question 
we ask is under which condition the payoff of the above reduced matrix present one 
configuration or the other. The answer can be found only by digging into consumer 
choice as we already saw in Part E on differentiation but this time taking into account 
the effects of compatibility and networks. 



24.2.2 Complements 

Bundled goods are made of complementary components such as a computer operative 
system working with software, a game console using games cartridges or a movie player 
reading movies tapes. Let us study the competition among firms A and B who produce 
consoles running compatible games which come in respective numbers Na and Nb. 

For some historical reason (not modeled here), consumers have a more or less marked 
preference x for a brand which is distributed uniformly over the [0; 1] interval. The sur- 
plus of consumer x is U A = x\/TJa when adopting system A and U% = (1 - x)\/Nb when 
adopting system B (complementary services have decreasing return to scale). 

The indifferent consumer (cf Fig. 11.1) is x such that U A = U~; his location deter- 
mines the respective market shares or consoles and it is trivial to compute ^r - ^ = 

jf- r Hence if new games become available only for a brand, that brand's market share 



increases. It is also straightforward to see the effect of prices when the games supply is 
proportional to the expenditure on the console i.e., when N A = D A {Y - p A ) where Y is the 
consumer leisure budget. 6@ We obtain ^ = -r^- o D A = ov ~ PA — so that an increase in 

° x Y-p A n 2Y-p A -p B 

price p A reduces the market share D A and even more strongly the supply N A . 

We have here a direct and simple explanation of the recurrent and fierce price battle 
occurring in the markets for PC's and game consoles. 

24.2.3 Compatibility 

We study oligopolistic competition for the establishment of a standard like mobile vs. 
fixed phone or laptop vs. desktop computer. 

Let us assume that consumers suffer a disutility c of not consuming their preferred 
model, either A or B. Otherwise they value the customer base x, so the surplus of a type 
i consumer from product j is Uf = Xj -clj^t. If the intrinsic attachment to a particular 
brand is not too large (c < 1) then both brands can become the de-facto standard i.e., 
achieve 100% market share. Indeed if brand A is the standard, no B customer can find 
it profitable to switch (alone). Otherwise, when c> 1, it is a dominant strategy to adopt 
one's preferred brand whatever its actual market share; hence both brand achieve a 
positive market share. 

To analyze an equilibrium with coexistence of different standards we have to consider 
the share a of A-lovers. It is easy to see that D A = a and D B = 1 - a is an equilibrium if the 
switching cost is large enough: c> 1 - 2 max {a, I- a}. Indeed, when A-lovers consume A, 
none of them wants to change to B if a > 1 - a - c. In the remaining cases, the preferred 
brand of the majority imposes itself as the standard. 

Defining social welfare as U A D A + UbD b , it is clear that the efficient standard is the 
one preferred by the majority but the two-standards outcome may dominate both. As 
its welfare is a 2 + b 2 while that associated to standard A is a + b{\- c), incompatibility 
dominates A if c> 2a. Hence incompatibility is the best outcome if c> 2max{«, I- a} i.e., 
when switching cost is high and/or types are evenly distributed. 

Arguments of coordination through repeated interaction 7@ guarantee that clients will 
coordinate on the correct standard when both are Nash equilibria; thus no market fail- 
ure occurs. There is a market failure if incompatibility is the Nash outcome and it is an 
inefficient one i.e., if ^ < max{«, 1 - a} < |. To conclude, we can say that according to the 
parameters values there are inefficient equilibria, efficient ones, displaying standardiza- 
tion or incompatibility. 

Network effects have also an influence on preemption strategies when for instance 
each customer's valuation of a product grows with the number of others adopting the 



product. In this case, an incumbent can profit from aggressive pricing to prevent entry, 
because the present losses are recouped later in the form of large profits derived from 
the larger base of captive customers. This is especially true if the prevention of entry 
encourages standardization on the incumbent's product and thereby lessens subsequent 
risks of entry (think of free software and M$). 

In the late 1980s MS bundled its software "Word" and "Excel" at a price slightly in 
excess of its leading product "Word" alone to gain the critical mass against "Lotus" and 
"Quattro". Then in the 1990s MS did the same with the Powerpoint program but this 
time in a preemptive manner to prevent the entry of another presentation program. 

24.2.4 Adoption of a Standard 

Intuition 

When looking at the speed of adoption of a given technology, whether a standard or an 
innovation, the percentage of users plotted against time always displays an S shape. 
Take-off is very slow because consumers fear embarking on a dead-end so that early 
adopters are risk loving people. Later on, there is moment where adoption accelerates 
because everybody is "jumping on the bandwagon" being eventually convinced by the 
usefulness of the standard and the fact that the more people use it the more valuable it 
is. The last phase of slower adoption signals that the product has reached its potential. 
As intuition suggests, the absence of coordination delays adoption because the optimal 
individual behavior has a free rider flavor: "it's better to wait until others switch, before 
doing it myself"; excess inertia could thus appear (cf §2.4.4). 

Farrell and Saloner (1985), using the famous backward induction reasoning, show 
this won't happen in a complete information world: if everybody has already switched 
then it is dominant strategy to do so for the last consumer. Then the penultimate con- 
sumer, conditional on the previous ones having switched, will also find it a dominant 
strategy to switch given that its own move will be followed by a last switch. This way 
everybody sequentially switches and the new standard is readily adopted. 

Obviously, this line of proof breaks down under incomplete information because no 
one can be sure that its own move will be followed by more switches. These authors 
provide us with a model of adoption of a new technology in a world where firms know 
precisely how much they would pay to adopt e.g., they know how the switch will be 
received by their clients. What they ignore is how much would their challengers pay, 
that is to say if they are more or less impatient to switch. 



Model f 

Each firm's willingness to adopt the new technology depends on a private information 
parameter 0. From the point of view of challengers, is uniformly drawn from [0;1]. 
Since there are 2 firms able to make 2 choices each, we are left with 4 situations. Assume 
then that a firm's profit as a function of her own parameter is 
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as shown on Figure 24.2. 

To inquire how fast firms adopt the new technology we use a two periods model of 
adoption among two firms. Assuming irreversibility, the available strategies are: 

• s\ : never switch 

• S2'. wait and switch if the other did 

• s 3 : switch immediately 

• s 4 : wait and switch if the other did not 

• S5: wait and always switch 



The important properties of these benefit functions are the existence of a positive 
network effect for each technology {B°\ d < B? ld and B n f w < B? e ™,), people with low 
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0's will play s x no-matter what (5^(0) = -3 < -1 = fl°jjj nc (0)) and people with high 0's 
will play S3 (B™™ (1) = 4 > = B? ld th (D). Note that S4 and S5 are dominated by s\ and 52 
respectively. 
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Figure 24.2: The Bandwagon Effect 



We show that there is a Bayesian equilibrium where firms with an intermediate will- 
ingness to change decide to "jump on the bandwagon", 8@ that is to say to adopt the new 
technology once the other has adopted (s 2 ). Technically the whole equilibrium strategy 

is 

s 1 if9<9 

o{6) = { s 2 otherwise 

s 3 if0>0 

where the thresholds satisfy 9_< 9. We now compute the payoff associated to S\, s 2 and s 3 
when the opponent follows a. We have: 

nm = eB° b l f th m + a-e)B°J l d one {6) = -a-9) 
n 2 m = e_B° b l o d th m + a- mzf h (0) = a-§) m - 3) 
n 3 (0) = 9B n a Z e (e) + (i-e)B n b e j h (9) = 8d-3-e 

The threshold 9 is part of an equilibrium only if the firm with type 9 is indifferent 
between strategies s\ and s 2 i.e., Ui(0) = n 2 (0) o§_=\. Likewise 9 is part of an equilibrium 
only if the firm with type 9 is indifferent between strategies s 3 and s 2 i.e., U 2 {9) = U 3 {9) o 
9 = |. We can conclude that 

I Equilibrium behavior leads to excessive inertia because firms fear to adopt alone and 
thereby lose the benefit of the network externality. 

Observe indeed that it is efficient to have both firm adopting whenever 

*»0i) + b»*2) > 2 0) o ^ > e* = I 

(cf Fig. 24.2) while adoption is delayed by at least one of the firm if either ^or 9 2 is 
lesser than 9. 

It can be shown that 

I Equilibrium behavior leads to excessive inertia because firms fear to adopt alone and 
thereby lose the benefit of the network externality. 

Farrell and Saloner (1985) project the result of their model to the political arena. On 
a controversial question, staunch supporters or opponents always commit themselves 
without waiting to see whether their stance become the popular view. Professional (wiser 
?) politicians often wait awhile to test the political waters, declaring themselves to be 
"for" the measure if the bandwagon begins to roll and "against" otherwise. 



24.2.5 On Deregulation 

In the US, the deregulation of the telephone sector took two steps. Firstly the ATT mono- 
poly was broken into one long-distance company and several regional local operators; 
secondly the long-distance market was opened to competition. Economides (1999) points 
at a neglected network effect: the total quality of a phone call is the minimum of sound 
quality on the two local lines and the long-distance one. This author claims that break- 
ing a monopoly telephone company into local and long-distance companies can reduce the 
overall quality of the service because the service is made up of complementary elements. 
Although the new firms will coordinate their quality levels, the double marginalization 
(cf §14.1.3) lowers individual profit margins and thus lowers quality choices. 

Consider the market for good AB with components A and B respectively produced by 
firm A and B. The final quality of good AB is q = min J^^b} where q A and qs are the 
qualities chosen by firms ex-ante. The utility of consumers is u{x) =xq-p where p is the 
price of good AB and the type x is uniformly distributed in [0;1] . Assume zero marginal 

a 2 

cost of production and total fixed cost <p A {qA) + <Pb^b) where (p^qO = c/y- for i = A,B. In 
the present situation there are no increasing returns to scale, hence no potential benefit 
to integrate the two firms. 

Consider first the behavior of the integrated monopolist. Since the sales for price 
p are 1 - £, the optimal price is p M = f so that the optimal final quality q maximizes 
U M {q) = | - <p A {qA) - (p B iqB)- The optimal qualities are thus q A = qB = q M = 4(c \ c ) and the 
total profit is U M = ~~r^-- r r . 

After deregulation firm A and B sell their component at prices p A and p B - Sales of 
both goods are identical and equal to 1 va+pb Assuming that prices are chosen si- 
multaneously we obtain a variant of the Hotelling model. The best replies are therefore 
Pa = ^p- and pb = ^-^ leading to the equilibrium, p% = p% = | and most notably to the 
final deregulated price p D = -%■ > p M i.e., a lower market coverage. We obtain here the 
inefficiency of double marginalization identified by Cournot (cf. §14.1.3). The remaining 
question is wether the consequence on quality choices is also negative. 

Ex-ante, the best reply of a firm to its competitor cannot be a higher quality since it 
does not result in any advantage, only costs. The profit of firm A being min W^'W _(p^q A ^ 
the best reply to qs is minj ^-,qB\- The marginal benefit factor of quality has dropped 
from | for an integrated monopoly to |, thereby generating a lesser investment into 
quality. This problem is also know as hold-up (cf. §14.2) because the necessity to bundle 
A's component to B's puts firm A at the mercy of firm fi's opportunism (it does indeed 
appear at the price stage). Firm A feeling that some of the rents of a higher quality are 
hold-up by firm B is lead to under invest. 



The choice of qualities has a unique equilibrium 

d d . j 1 1 1 1 
c\a - Qn =min^ , > < . 

The difference is minimized at =|- when ca = cb = c. In this symmetric case, total profits 
are n^ + rig = ^ < ^ = U M . The consumer surpluses are respectively S M = ^ and the 
much lower S D = j^. 

Observe that the utility function min {q a, cjb} is the limit of a constant elasticity of 
substitution (CES) function as the elasticity of substitution tends to zero. Hence the 
previous results hold for goods whose components are highly complementary. 

24.2.6 Network and Competition 

Since the interaction between a consumer and its service provider lasts several periods, 
we may start to care for the size of our network in the future because we anticipate 
a greater level of service satisfaction. The general impact of this network effect is to 
intensify competition because consumers are more valuable inso far as the network effect 
turns them into prisoners of the firm i.e., they are unlikely to leave a large network to a 
smaller one. 

We build on Hotelling's model of competition seen in §5.2.2. The specific addition is 
that people display an additional WTP p per fraction of the entire population present in 
their network. The utility from staying with firm A in the second period is u x {pa,cja) - 
v + hqa -tx-pA where qA is the first period market share of firm A; a symmetric formula 
holds for firm B. We assume full market coverage, so that qA + qB - 1. Working out the 
indifferent consumer yields demand D A - PB P ^ t+ ^ A where p A - p{2q A - 1). The equilib- 
rium is then p A = t+ ^-, as in the standard asymmetric setting and profit is \- t p 2 A , after 
simplifications. 

The full profit over both period, as a function of first period price pa is then tia = 
Pa^a + YtP\- T ne FOC is = qA - ^ [Pa + ^ (1 + ff) J which simplifies into | = ^{p + -£■) at 
the symmetric equilibrium where qA-\- The equilibrium price is then p* = t--jr which 
is smaller than in the standard case due to the exacerbated competition caused by the 
value of catching valuable consumer first. 

24.3 Critical Mass 

Positive network externalities create increasing returns to scale for firms which often 
give rise to extreme market structure where a single standard remains. It is thus impor- 



tant for each contender to be the first one to reach the critical mass, if not to dominate, 
at least to survive. 

24.3.1 Pros and Cons of joining a Standard 

In the late 1990s two technical hardware specifications for the DVD were competing to 
establish themselves as a standard, the single layer and the double layer technologies. 
A few years later, once this was settled, there was another battle among two techni- 
cal software specifications for rewritable DVD, the so-called "DVD+R" and "DVD+RW". 
Most manufacturers of consumer electronics and computers choose to support only one 
standard. In 2004, the battle rages over the next generation DVD between the so-called 
"blu-ray" and "hd-dvd" formats. 

The decision to join a given alliance is guided by three effects, private cost, social 
cost and competition. On the one hand one wishes to adhere to the standard best suited 
to one's own technology but on the other hand it is important to embark on the largest 
alliance because it is more likely to produce learning gains and positive feedback. Yet 
being more numerous to share the same standard is a guarantee for tougher market 
competition in the future. It is therefore not trivial to assess the right decision. The 
following model tries to give hints. 

Assume that the n potential adopter firms are differentiated by their innate prefer- 
ence 6t • e [0; 1] among the two standards called A and B. The network effect is captured as 
follows, let riA and hb be the number of member of each alliance then the marginal cost 
is Ci = c-n.A{l-di) for a member i of alliance A and Cj = c-nsdj for a member j of alliance 
B. 

The decision to leave one alliance to join another affects one's marginal cost in two 
ways, directly through the innate proximity to standards and indirectly through the 
network effect. Recall now the n firm Oligopoly Cournot model studied in §5.1.3 and 
in particular eq. 5.14 for asymmetric constant marginal cost c/. We found that the 
equilibrium individual sales were 

* a-b{nc i -'Lj^c j ) 

qi= WTi 

Since the equilibrium profit of firm i is proportional to {q*) 2 , what matters for a member 
of alliance A is to minimize nct-Y,i^i ci or equivalently to maximize 

n A a-0i) ^L_ 



n 



In this formula, n A {l -dfi is the product of the private technology parameter by the size of 
one's actual network while the remaining term is the negative competition effect induced 
by the network affiliation of other firms. 

Likewise a member j of alliance B maximizes 

_ n A LkeA^-Ok) + n B L k ^J B e m 

n B u i 

1 n 

hence the decision for i to leave alliance A to join B is profitable if 

nn A {l - di) - n A Y,kEA,k?i (1 ~ 0fc) ~ «B LmeB 9 m 

< n(n B + 1)9 1 - (n A - 1) LkeAk^ 1 ~ e k) ~ (n B + 1) L m eB,m^i dm 

or 

nn A {l-6i)-n{n-n A + l)6i< £ Cl-^t)- L 6 m = n A -l-n6 + di 

kEA,k^i meB,m^i 

where = hY.i&i is the mean of the 6 distribution. Hence, rearranging, firm i remains 
in alliance A if 6 t < Wj4( "^ ) w + + \ + " e - ■ It is quite immediate to see that low 0's all belong to 
alliance A while large 9's belong to alliance B hence the last member of alliance A is the 
largest integer i satisfying 

6i<- — t 1 

n A + n + 1 

which shows that the dispersion of all firms along the axis of innate preference among 
standards determines the final size of alliances. If for example the 6>'s are evenly dis- 
tributed with di = ^j- then the size of alliance A is ^ which is approximately half of the 
industry. 

24.3.2 Consumption Externality 

We adapt Economides and Himmelberg (1995) 9@ to show how the presence of a network 
externality in consumption changes the usual conclusions of the neoclassical theory. 

Introduction 

The most striking feature is that there is no small size market in equilibrium, a critical 
mass of consumer must subscribe to the service for its existence to be sustainable. Fur- 
thermore this property is mostly independent of the market structure. We shall see that 
the network externality introduces discontinuities or phenomena that might recall chaos 
theory in the sense that a small change of an underlying parameter triggers a strong and 
far reaching reaction. 



We model the telephone service using bits of models seen for horizontal and vertical 
differentiation. The willingness to pay for a phone call is Sx where S is the total quality 
of the service and x £ [0; 1] captures the heterogeneity of consumers. Total quality is made 
up of an intrinsic component s > and a network component, the expected subscribers 
base b. For a price p, the last subscriber will be x such that (s + b)x = p so that the 
aggregate demand is q = 1 - x. We can therefore express the price as a function P{q,b) = 
{s+b){l-q) of both the actual demand n and the expected one b with P q = -s-b (traditional 
negative price effect) and Pt = l-q (positive network effect). 




Figure 24.3: Network Effect 



For normal goods, demand always slopes downward but for network goods, the will- 
ingness to pay for the last unit increases as the number expected to be sold increases. If 
expected sales rises with actual sales, then the willingness to pay for the last unit may 
increase with the number of units sold. To see this we use the equilibrium condition stat- 
ing that expectations are to be fulfilled (b = q); the demand q thus solves P{q, q) = p hence 
D{p) = | (l - s+ ^s{s + 2) + 1 -4p) as plotted on Figure 24.3 (the smaller root is unstable: 
a small increase in customers raises the utility of a few more people who then subscribe 
and convince even more people to do so). As can be observed from the figure, the market 
is either void (large price) or reach at least the critical mass q° that maximizes P{q, q). l0@ 
The impossibility to get a small market coverage stems from the vicious circle according 
to which many consumers are not interested in subscribing because the installed base is 
too small, and the installed base is too small because an insufficiently small number of 
consumers have subscribed. 



Equilibrium Demand 

In a perfectly competitive market the equilibrium is characterized by p = c, hence q pc = 
D{c). 

Turning to efficiency we observe that if b units are sold, the net consumer sur- 
plus of unit q < b is P{q, b) - c thus the total surplus of consuming b units is W[b) = 
f {P{q,b)-c)dq. For the same reason as before, the efficient subscriber base is the 
largest root 11@ q* of the equation W' = and since W'{b) = P-c + f PUq, b)dq > P-c we 
deduce that q* > q pc i.e., the first theorem of the welfare does not hold because competing 
firms do not account for the positive externality in their profit maximization. 

We now inquire how the exercise of market power is affected by the presence of the 
network externality. As seen on Figure 24.3, the marginal revenue P + qj£ - P + qPq + qPb 
is greater than P for small sales, equal for sales of exactly q° and lesser when the demand 
has the traditional shape. The optimal price is nevertheless found without ambiguity as 
the largest root 12@ n M of P + q^fi = c because the smaller root corresponds to a deficit 
(p < c). By the same token we see that if c> c°, the monopoly does not supply this market 
because it would lead to losses; otherwise it supplies q M < q pc by quoting P{q M ,q M ). 

Telecommunication 

Phone, SMS, email are obvious examples of services with a high positive network ex- 
ternality. For each of them the market can be divided among high and low valuation 
consumers (e.g., firms and households) 

Consider an equal number n of both types; the individual surplus is SiB - p where B 
is the consumer base, p the price of the service and s ( the willingness to pay (sh > s/). 
Assume 2s/ < %, then 

- if p > Shn, then nobody connects. 

- if p e [2sin;shri], then only firms get connected because the network effect is too small 
(even if all households connect the price remains too large). 

- if p < 2S/n, everybody connects to the service. 

The ratio pish can therefore be interpreted as the critical mass of firms that need be 
convinced of the service values in order not to be deceived later on. If the ratio Sh/si is 
large then a non regulated monopoly would choose not to serve households; this may 
explain why PTT is either a public service or a regulated activity in most countries. 



24.3.3 Big Push 

In this section, pecuniary externalities as opposed to purely technological ones 13@ are 
shown to matter for economic development. 

Context Ever since the industrial revolution, advanced economies have followed a sim- 
ilar developmental path whereby most sectors industrialize simultaneously i.e., massive 
investments into large scale production take place within a decade. Such a move is gen- 
erally unprofitable for a single firm operating in a backward economy because fixed cost 
are too high and/or the output market is too small given the low wealth level. However, 
if all sectors develop at the same time, typically in large cities, fixed cost items become 
cheaper because of closeness. Furthermore, industrial salaries increase (due to scarcity 
or union pressure) giving rise to a middle class that can afford the new products. This 
demand pull, in turn, rationalizes for each firm the decision to invest into large scale 
production. It thus appear that the same economy can reach equilibrium with or with- 
out industrialization because the latter requires coordinated move to become profitable. 
Murphy et al. (1989) formalize this "big push" theory of industrialization. 

Model The supply side of the economy features an obsolete technology (e.g., agricul- 
ture, handicraft) transforming labour into output on a one-to-one basis (after normal- 
izing units) and a modern technology (e.g., industry, services) with marginal cost c < 1 
(i.e., labour productivity \ > 1) but only if the fixed cost F (measured in labour units) is 
sunk. The choice for entrepreneurs is thus between constant and increasing returns to 
scale. Potential Bertrand competition among modern entrepreneurs leaves room only 
for a monopoly. 14@ At the other extreme, the obsolete technology can be operated at any 
scale by anyone so that this sector forms a competitive fringe pricing at marginal cost. 
Taking the labour unit as money, wage is unitary. An obsolete firm earns zero profit 
while a modern firm producing q earns n = q{p - c) - F . An entrepreneur is willing to 
create a modern business if he can get enough demand to cover his fixed cost out of his 
producer surplus; formally n>0o q> q = -^, the minimum efficient scale. 

There is a continuum (size L) of consumers/savers who can either work for the modern 
industry (inelastic supply) or operate the obsolete technology. Their aggregate income m 
is total labor earnings L (at unitary wage) plus aggregate firm profits (over the two tech- 
nologies). The driver of the model is the assumption of a unit continuum of product 
varieties upon which consumers hold identical Cobb-Douglas preferences. When charg- 
ing p < 1, the industrial monopolist of a given variety has demand ^ hence variable 
profit m(l - £ ) so that he wishes to increase his price. However, the competitive fringe 
forces him to adopt the unitary price. Hence, whether a sector (variety) industrializes or 



not, the equilibrium price is unitary. This in turn means that the aggregate income m 
is equally shared among the varieties i.e., each receives demand q = m. 15@ If a fraction 
A of entrepreneurs develops the modern technology, aggregate income is m = L + An. De- 
veloping profit as a function of output, we obtain q = m = yijM^a an d the entry condition 
becomes independent of A as it reads q > q o izm^F) > T^c ° F < ^ _ c " sL - 

This means that the (unique) equilibrium is either full industrialization or none. The 
outcome thus depends on the fixed cost F or the market size (proxied by L) as well as 
whether the innovation is drastic (low c). 

Multiple equilibria Multiple equilibria appear only if the industry wage is exoge- 
nously maintained above the rural one at w > 1 (as we show hereafter). This premium 
may be a compensation for disutility of work, moving to town or the result of union pres- 
sure made possible by the life in the city. Alternatively, once everyone lives in the city, 
the fixed cost of establishing a modern firm is diminished because all required services 
are closer. At the other extreme, innovating in the country side is difficult (high fixed 
cost) because specialized talents do not reside on site but have to be brought in. 

No development remains be an equilibrium if (1 - c)L < wF i.e., if the fixed labour 
requirement, paid at city wages, is too expensive. Imagine now that everyone adopts the 
innovation, then the entire labour force gets paid w as if there was no city premium (w = 
1); this is akin to lowering the fixed cost. This opens the door for unanimous adoption to 
be an equilibrium. More precisely, if everyone works at a modern firm, output is q = ^-, 
thus profit is n = ^(1 - cw) - wF = ^ - wL = (1 "" )L " f . Multiple equilibria occur for 
^ < j<l-cw which is possible as soon as 1 < w < 1+ ~^ c+4c . 

Since two qualitatively different equilibria are possible, there is a role for government 
to try to force the better one by building expectations or jump start industrialization in 
all sectors aka the "big push". 

Infrastructure The previous intellectual construct can be applied to infrastructure 
building which has been recognized as a key ingredient to economic development. 

The economy enjoys two IRS sectors of similar size, say service with a low fixed cost 
and industry with a high fixed cost so that firms in the first sector generate twice more 
surplus. Assume then that aggregate surpluses of 4 and 2 (say bn$) are generated only 
if an infrastructure costing 5 is build. Industrialization of the service sector alone and 
appropriation of its surplus via discriminatory charging for the use of the infrastructure 
is not enough for financing (4 < 5) although construction would be efficient since total 
surplus is 6 > 4. If both sectors industrialize but the infrastructure operator cannot price 
discriminate among them (for whatever reason), it will raise only 16@ 2+2 = 4, go bankrupt 



and interrupt service; it may even anticipate this outcome and refuse to build (especially 
if he is risk averse). At a further round of anticipation, industrialists from both sectors 
may decide not to industrialize. There is thus a role for the country's business associa- 
tion or a large bank or even the government to try to design a financing scheme for the 
infrastructure that guarantees its viability. Otherwise, the country is kept in a so-called 
"poverty trap". 17@ Note finally the futility of building a magnificent harbor if local busi- 
nesses cannot engage into exports. Thus, authorities must also provide cheap finance 
for potential users of the infrastructure to help (convince) them to switch to the modern 
technology. This last issue is the complementarity of the inputs and services needed to 
allow an industry to develop. 

Banks The history of continental Europe saw large banks playing a coordinating role 
in financing industrialization. Da Rin and Hellmann (2002) rationalize this fact by show- 
ing how a bank with large market power may force the big push even when all actors 
hold pessimistic beliefs about the wave of industrialization. 

Assume to simplify that the profitability of a single firm is an increasing function n 
of the share of firms having industrialized and that n(0) < 1 + r < n{l) holds where r is 
the marginal cost of money (rate of refinance on international money markets) for the 
competitive banking fringe i.e., if no one industrializes it is not possible to finance the 
first one but if all do then all upgrades can be financed. There exists a large bank with 
a cost advantage (refinancing rate r < r) able to force unanimous adoption by offering 
extremely attractive financing conditions to early adopters. If there are n firms (ordered 
according to some accounting screening), firm #i is offered debt at rate min{r,r ; } where 
1 + r, i = Tt[ L ^-) (the competitive fringe must be kept at bay). If the belief is that only 
the first i - 1 firms adopt then they will indeed adopt since their loan is profitable but 
crucially, firm i will also adopt since her rate is designed so as to make her indifferent (or 
happy if her rate is r). This proves that only 100% adoption is a coherent (equilibrium) 
belief. Now, this peculiar strategy is profitable for the large bank if she can recover the 
losses from early adopters out of the profits she makes on later ones where her margin 
is r - r . The feasibility condition is thus dependent on this margin and her protected 
market share. 

We have here an instance where an oligopolistic structure is dynamically efficient 
because it alone can avoid the pessimism that would keep the economy in a poverty trap. 
The key is that big banks accept losses early on because they can recoup them later on 
thanks to their maintained market power. 



24.3.4 Rent-Seeking 

In line with the previous model, Murphy et al. (1993) show that the potential existence 
of rent-seeking (cf. §7.2.1) creates indeterminateness i.e., an economy may reach either 
an efficient or inefficient equilibrium. If history has brought the economy in the bad 
equilibrium, a revolution rather than an evolution is necessary to break out and reach 
the good equilibrium. 

Imagine that individuals can engage into agriculture (A), business (B) or rent-seeking 
(R) with potential returns a, (3, y . Compared to activity A, B is less local, more export ori- 
ented, more intensive in capital (human and physical), more risky, operates at a greater 
scale and is potentially more profitable i.e., ft > a. However, all these characteristics 
make B an easier target for rent-seekers (compared to A). To simplify, we assume that 
only B can be plundered by an amount y < p. Private rent-seeking will be theft or fraud 
while public rent-seeking will be corruption for permits and licenses that business needs. 
In line with the paradox of exploitation (cf. §7.2.5), rent-seekers are assumed to secure 
a greater income than producers of basic staples, thus y > a. 

Let rii for i e {A,B,R} denotes the share of the population in each occupation. When 
n R = 0, the population structure settles at [n* B ,n* A ), a distribution obeying complex cul- 
tural and historical relationships not modeled here. Rich businessmen and poorer peas- 
ants earn respectively ft and a per capita. As peasants start to switch to rent-seeking, 
each earns the greater y. Given a distribution of the population among activities, the 
total income to activity B is pnB-yriR, thus the average income is jir = fi-xy where x=^f- 
is the ratio of rent-seekers to businessmen (our main variable of analysis). Meanwhile 
tib > a, more peasants flock into rent-seeking but for x > x = ^p, each businessman can 
threaten to revert to agriculture if his rent-seeker does not leave him at least a (so to 
speak). This means that the pie available to rent-seekers is no more than (/3-a)nB, 
so that their per capita income is jtr = ^-, a decreasing hyperbole. Let x = ^- > x be 
the solution of tir = a. Figure 24.4 graphs these per capita profits as a function of x; 
intersections between ttb and tir occur at x and x. 

There are two equilibria where no one desires to change occupation. One involves 
no rent-seeking. Although each peasant would like to become a rent-seeker, it is cheap 
to deter such a behavior when the phenomenon is rare. This is an efficient equilibrium 
where wealth creation is maximum. The second and inefficient equilibrium is x where 
there is so much rent-seeking that everybody earns the low secure income a. Lastly, x is 
not an equilibrium because there is already a significant proportion of rent-seekers and 
any small increase creates a larger wedge in returns that triggers more entry into the 
rent-seeking activity. 




Figure 24.4: Multiple Rent-seeking Equilibria 



24.3.5 Myths 

Rule of the Road 

In every country, the rule of the road compels driving on the same side of all roads. 
Contrary to popular belief, this choice is not a caprice of history but an optimal response 
to changing technological conditions. The driving factor is that most people are right 
handed and thus use their right arm to apply force or precision. The evolution of the 
devices they had to operate decided which side of the road was more convenient for 
travel. 

As for walking, most people appear to have a natural tendency to keep to the right. 
During the middle ages, transportation used animals lead by the right hand of a walking 
person who was then on the left side of the cart. The driver would keep to the right of the 
road in order to position himself towards the center and avoid collisions with incoming 
traffic. 18 ® 

Later on, wagons and passenger coaches were driven by a single man holding a whip 
in his right hand and sitting or standing at the back of the cart; he would sit on the 
right side to avoid hurting passengers with the whip. Having changed side of the cart, 
he needed to change side of the road and travel on the left in order to remain close to the 
center. 

At the same time, large distance transportation developed in the US and continental 
Europe and it made sense to use large wagons and stagecoach pulled by many horses 
where the driver would ride on a rear horse. Since the driver keeps the whip in the 
right hand, he must ride a left horse to use it properly and is thus lead to travel on 
the right side to remain close to the center of the road. This new custom was enforced 



on turnpikes (toll roads) and quickly became widespread in the entire US. The story 
for continental Europe follows a different kind of rationale. It is believed that Napoleon 
enforced walking on the right side for his soldiers because the widely used musket (riffle) 
was better aimed towards the left (as horsemen do with their lance in jousting). 19@ This 
new walking custom, agreeing with France's custom for wagons, was then imposed onto 
conquered countries i.e., most of continental Europe. 

Lately, some (surprisingly?) English authors have claimed that driving a car on the 
left side of the road provides with a greater level of security due to some asymmetries in 
the brain. Even if this advantage was noticeable, it would become increasingly irrelevant 
as more traffic takes place on protected highways. This illustrates the fact that the 
current equilibrium may not follow what the latest science might call for but the ups- 
and-downs of historical adaptations to seemingly unrelated technological changes. 

Video Cassette Recorder 

As recalled by Spulber (2002), the battle between Sony and JVC overs VCRs in the 1980s 
was a competition for market dominance where both tried to impose its standard (Beta 
and VHS respectively). This story is often presented as a path-dependent market failure 
where the bad standard, VHS, won (quite inexplicably) over the good one, Beta. The true 
story is a bit different. 

VCRs was invented by the US firm Ampex in 1956 and was marketed to profession- 
als during the 1960s. Ampex licenced its patents to several Japanese firms, among them 
Sony and Matsushita, in order to penetrate this market but all VCR models were in- 
compatible. In 1970 no less than five firms tried repeatedly to launch a VCR for home 
use. All failed but Matsushita learn that the playing time of its VCR was deemed too 
short by consumers. In 1975, Sony launched its Betamax with 1 hour playing time using 
compact tapes. In 1976, JVC, a subsidiary of Matsushita, introduced a similar machine 
with bigger tapes that allowed 2 hour playing time but less possibilities for editing, a 
key feature in the professional market. When Sony launched the Beta II with a 2 hours 
recording capability for the US market in 1977, it took only 6 weeks for JVC to launch a 
4 hour recording VCR. The latter feature proved crucial as VHS quickly outsold Beta. 

During the 1980s, both competitors made alliances to increase the market share of 
their standard in a very close timing. They both improved their technology and lowered 
their prices but the VHS sales advantage never vanished so that the installed base of 
VHS consumers grew much bigger than the Beta. This fight repeated in every single 
market with the same outcome, VHS outselling Beta. Still Beta, by permitting edit- 
ing, remained the de-facto standard of the professional broadcasting industry. This may 
be why it had a supposedly better picture quality but this was never confirmed, if not 



contradicted, by consumers reports. With respect to the alleged market failure we may 
conclude that there was no bad standard but two standards adapted to two markets, the 
professional and the individual ones. 

If the payoffs for compatibility had been higher then both would have agreed on one 
of the standard but this may have generated a high welfare loss. Indeed, the loser of the 
battle, Sony, has never lost money with Beta thanks to its leadership of the professional 
market; this has permitted and encouraged Sony to enter into a Bertrand competition 
against JVC. Both products have improved and their prices have fallen at a pace not 
observed for a protected good like Polaroid instantaneous photography. 

Microsoft 

Without judging upon the alleged monopolistic behavior of this company in the Internet 
era, the success of its spreadsheet and word processor deserve to be told. Before proceed- 
ing we must make a detour to the Apple world to understand Liebowitz and Margolis 
(1999)'s claim: Microsoft gambled on the graphical user interface (GUI) pioneered by 
Apple, spending time and money to learn well this technology, to later on launch the 
software that made its fortune. 

Back in 1984, Apple introduced the "Macintosh" operating system (OS); although 
nicer and easier to use than text-based OSes like DOS it was also more expensive (more 
costly components like memory and video display), exclusive (no licensing to competitors) 
and much slower since the graphical interface used a significant part of the processor 
resources (all processors were and still are roughly equally powerful). Business users, 
by far the largest market, love to crunch numbers and have always been repelled by the 
slowness of the "Mac" OS. This is the key explanation put forward by analysts to justify 
that this product never caught a large market share. At the same time, the graphical 
capabilities of the "Mac" are at the source of its success in graphical intensive industries 
(a similar dichotomy exists in the Beta vs. VHS case). 

We can now get back to Microsoft who like its competitors of the 1980s produced 
software not only for its own best selling "DOS" but also for the Macintosh. The "Excel" 
spreadsheet was introduced for the Mac (1985) before being ported to the "Windows" OS 
(1987). Likewise "MS Word" was launched for "DOS", then for the "Mac" and later for 
"Windows". 

The spreadsheet leader in the 1980s was "Lotus 1-2-3" who it self had outperformed 
the first historical leader in just one year thanks to its technological advances. In 1987, 
Borland launched "Quattro" and MS ported "Excel" to the IBM PC world with the then 
flaw that it needed the "Windows" OS to operate. The newer products were systemati- 
cally judged better than the leader, "Excel" being ranked first most of the times although 



it initially suffered from a sluggish responsiveness. Both market shares stayed between 
10 and 20% from 1988 until 1990 while Lotus managed to retain more than 50%. Then 
with the improvement of hardware and the "Windows" OS, the superiority of "Excel" be- 
came clear given that Lotus never successfully ported "1-2-3" to "Windows" (according 
to the professional reviewers). The economic results followed since MS's market share 
steadily gain to reach 70% by 1995. 

Liebowitz and Margolis (1999) reason that Lotus's technological failure to match "Ex- 
cel" capabilities is due to its immaturity with GUIs, more specifically because they never 
dare spend a $ into the Macintosh world. The latter attitude makes sense from a short 
term point of view since the Apple market was small thus not very profitable. This is 
why one may think of the Microsoft association with the Apple world as an investment 
for the future. 20@ In fact Lotus followed a similar path by investing into the development 
of a revolutionary spreadsheet for a very advanced OS, "NeXT". The fate of Lotus came 
from the complete failure of this OS to break up in the mass market, in due time. In a 
twist of history, NeXT happens to be the ancestor of the "Mac OS X" system that made 
the fortune of Apple. 

The market for word processors was fairly competitive back in 1986 with "WordStar" 
and "WordPerfect" on top. The latter subsequently gained market share to reach 50% by 
1990 and was recognized of highest quality in its "DOS" version by readers of computer 
magazines; however "WordPerfect" did a poor transition to "Windows" as attested by 
worse reviews. On the contrary Microsoft succeeded to increase its already good ratings 
for "DOS-Word" to top-of-the-class for its "WIN-Word". Adding the fact that "WordPer- 
fect" came into the "Windows" world 2 years after its Microsoft competitor, there is no 
surprise to see that the MS product gained a steady 10% of market share every year 
from its introduction in 1990 until 1998. 

The analysis of prices is also a good indicator of the competition that took place in 
these two markets in the 10 years period 1988-1997. For spreadsheets, the price of "1- 
2-3" dropped from 300$ to 20$, that of "Excel" dropped from 250$ to 60$ although they 
were quite similar most of the time. "Quattro" was a better bargain, starting at 50$, 
reaching 100$ before dropping to 20$. As for word processors, prices started around 
160$ and slightly rose until the introduction of "Windows 3" in 1990; thereafter a strong 
decline occurred. Microsoft priced "WIN-Word" like "Excel" from 250$ at its beginning 
in 1990 down to 60$ in 1997. Competitors that were once more expensive ended up at a 
bargain price. The success of Microsoft resided in the bundling of its two star products 
in the "Office" suite sold at an attractive price (less than the combination of independent 
software whether from MS or not). 

We cannot conclude this chapter without a word on the famous QWERTY contro- 



versy. This keyboard was developed to prevent the hammers on early mechanical type- 
writers from jamming; the letters configuration has carried over into the present day 
even though the initial reason for its arrangement no longer applies. It has been al- 
leged by many economists that the newer Dvorak Keyboard was superior and so that the 
lock-in of millions of users into the old standard was a clear example of inefficient path 
dependency. Liebowitz and Margolis (1990) demonstrate the fallacy in full detail; one 
detail deserves special mention: the only evidence of the Dvorak superiority are studies 
undertaken by the inventor Dvorak himself! 

24.4 Two sided platforms 

To be successful, a platform like a game console must offer many games to attract gamers 
but it also has to be popular among gamers to attract game developers; the owner must 
therefore address the celebrated "chicken-and-egg" problem to get "both sides of the mar- 
ket on board". Rochet and Tirole (2003) argue that most markets with network exter- 
nalities display such a two-sided feature and study the business model of the platform 
owner, that is how he/she courts each side while making money overall. 

24.4.1 Game consoles 

Rochet and Tirole (2003) report several failures of highly innovative platforms who pro- 
vided a better game experience to users but lacked good games to play with; the first 
generation XBox disappointing results are generally attributed to the limited number of 
available games. Strangely there are no reverse result. 

In this sector, the console developer such as Atari, Nintendo, Sega, Sony or Microsoft 
charges game creators a two-part tariff. Firstly the development kit that enables testing 
a game on the console is sold for a fixed fee. Later on, royalties are paid to the console 
manufacturer over the sales of games. The platform itself is often sold with a loss to end- 
users, it is called a loss-leader. 21@ For the launch of its XBox, Microsoft tried to appeal 
to game creators by offering an inexpensive development kit running on (cheap) PCs to 
free game developers of working on (expensive) workstations. 

Similar to game consoles, softwares designed to display web content (e.g., browser, 
media player, text viewer) are free of charge for consumers while the servers that gen- 
erate the content are sold to creative firms. Likewise most of the media make money on 
advertising and subsidize the price of the end-user product be it newspapers or TV. 



24.4.2 Portable Devices 

Interestingly, operating systems (OS) for the personal computer (PC) and handheld de- 
vices have adopted the opposite business model probably motivated by the success of 
Microsoft in applying it. An OS developer derives profits from consumers through the 
royalties hardware makers pay to get their machines running the OS (or directly as in 
the case of Palm and Apple). The loss for the OS developer comes from the obligation 
of making crucial code freely available years in advance to foster the development of 
applications running on the OS and make the package attractive to potential clients. 

Apple has successfully build a two sided platform for portable music. Late 2001, this 
company launched the iPod, a portable music player packing thousands of songs into a 
tiny and nicely designed shell. Despite being much more expensive than its competitors, 
this product succeeded to conquer a large market share by the end of 2002 thanks to 
its offer of useful features like compatibility with "MS windows", synchronized agenda 
(typical of digital organizers) and above all a very easy management of songs thanks 
to the integration with the iTunes software. The next move by Apple in May 2003 was 
to turn this software into an internet music store, bursting an instant success crowned 
by Time Magazine as "invention of the year". The reason behind the success is that 
the flexible Digital Rights Management (DRM) system negotiated with the major record 
labels enables iPod users to enjoy their legally bought music on the move. What older 
online music stores were missing was the integration of the software with the hardware 
that provides users with greater satisfaction, thus greater willingness to pay. 

Yet, Apple has just recognized that iTunes Music Store was a loss-leader due to the 
high level of royalties paid for DRM. There are two possible explanations; on the one 
hand, the service might become profitable in the future for the possible presence of scale 
economies and on the other hand, the software activity (iTunes) might just be a boost to 
the profitable hardware sales (iPod). Sales figures as well as the market value of Apple 
from late 2005 seem to point at the latter explanation. 

24.5 Social interaction 

Veblen (1899) and more recently Leibenstein (1955) and Becker (1974) have argued force- 
fully that consumption decisions are made to satisfy both material and social needs: the 
pleasure derived by consuming is affected by the consumption choice of other consumers. 
Vanity and Conformity (aka ) are two complementary explanations for such needs. 

In the same vain, Schelling (1969) analyzes location choices and shows that small 
perturbations such as the arrival of people of different characteristics can trigger moves 



that propagate and radically change the entire distribution up to the point where com- 
plete segregation results (neighborhoods only contain homogeneous people). 

24.5.1 Status seeking 

Conformity and Vanity 

To maintain cohesion and defend itself against aggressions every society has formed 
habits and customs together with the censure of nonconformist attitudes. As a result, 
individuals imitate each other because they feel the desire to avoid social ostracism. In 
this case, social interaction leads to conformity. Products like garments or beverages are 
very conformist especially in the population that most need social recognition, the youth. 
Another consequence of social conformity is the "winner takes it all" effect observed in 
entertainments, sports and the arts where the number one in sales or audience gets 
much more than its followers because he (she) is the object of a collective passion. 

On the other hand, human beings have individualistic values and look for prestige or 
social recognition. If they can afford it, they try to signal their idiosyncrasies by their 
consumption of positional goods that are better observable than their income (and not 
subject to taxation). Examples of luxury goods are perfumes, sport cars, "haute couture" 
or membership of selective clubs. Social interaction now leads to vanity. Veblen held the 
view that the lower classes were not out to overthrow the upper class as Marx believed 
but, rather, strived to climb up to it. It is quite straightforward to see that the desire to 
surpass the neighbor or least to keep up with the average leads to excessive consump- 
tion. 22@ Social status can be so important relative to intrinsic utility that individuals 
are rationally lead to adopt conformism as a standard of behavior, despite heterogeneous 
underlying preferences. 

Impact on firm behavior 

We study Grilo et al. (2001)'s adaptation of Hotelling's spatial model explaining firm' 
strategic pricing behavior when consumer preferences exhibit either conformity or van- 
ity. 

As in §11.1.2, we consider consumers uniformly distributed within the [0;1] segment. 
Stores A and B are located at distance %a and xb respectively; these figures can be both 
larger than 1 meaning that the farthest away may sells a product of inferior quality. The 
consumer with characteristic x, who buys from store i = A, B derives utility 

Ui{x,ni) = p-pi-t{x-Xi) 2 + an { 



where ani is the externality affecting store i with market share n ; . Vanity is present for 
a < and conformity if a > 0. The model yields interesting result only in the presence 
of an asymmetry among the two firms. Let us then assume that firm A has a locational 
advantage with xa + xb > 1 . 

The equilibrium is characterized by the indifferent consumer x for which ua(x, ha) = 
ub(x, n B ) and two additional market clearing equations ha = xn and n B = (1 - x)n stating 
that the expectation n t for store i corresponds to its actual demand, the product of market 
share by market size. Solving this system, we obtain 

Pa+ t{x- xa) 2 + ccxn = pb + t{x- xb) 2 + a{l- x)n 
=> 2tx(XB~XA) -2axn = pg- Pa+ t{x 2 B - x 2 A ) + axn 

Pb - Pa+ t [x\ - x 2 A ) - an 



x = 



2{t{xs- xa) -an) 



D A = x oc pb~ Pa + t{x B - x A )-an (24.1) 

Db = 1-x oc Pa~Pb - t {xb - xa) {2- xa~ xb) -an (24.2) 

To find out the price equilibrium, observe when there is no network effect i.e., a = 0, 
the threshold x is identical to x derived in eq. (11.2) of §11.1.2. Up to a multiplicative 
constant, the denominator, demand is D A = x- an, thus when maximizing the profit 
rU = {pa -c){x- an), we obtain the best reply Pa^Pb) - an/2 where pA is the no-network 
best reply function (eq. 11.3). It is as if the marginal cost c was inflated by an. The 
equilibrium is thus the standard Hotelling solution (11.5-11.8) minus the externality 
computed for the entire market. 23 ® 

We can compute the equilibrium market share with 

t{XB - Xa) (2 + xa + xb) -3an 

x- 

6{t{XB~ xa) -an) 

It is readily seen that if stores are symmetrically located with xa + xb = 1 then x = 1/2 and 
is therefore unresponsive to the strength of the externality or the market size. If store A 
has a location advantage then x > 1/2 increases with an. 

Vanity yields higher market prices while weak conformity generates lower prices. As 
the market expands or as the externality grows stronger, the firm with a location 
advantage loses market share under vanity but gains under weak conformity. 



24.5.2 Rationing 

Leisure Services 

No one likes to dine, drink, dance, listen, view or patronize any other social activity in 
an empty room; the larger the audience, we more we appreciate the act. Among reasons 
explaining this behavior, we get confirmation that our choice was good if others do the 
same and we appreciate to be seen or being able to claim presence at a "successful" social 
event. Social events therefore incorporate a positive network externality. 

Because of this positive externality, we tend to express demand at times where we be- 
lieve others are likely to do the same, hence strong demand gets concentrated at specific 
moments. Patterns of modern life also tend to amplify this phenomena since the leisure 
time gets limited by the standardized work schedule. We shall thus speak of a short 
lasting peak demand and a long lasting off-peak demand. Examples of peak periods are 
noon and evening for restaurants, week-ends for cultural events (movie, theater, concert, 
sport) and high season for holiday trips. 

Another characteristic of social events, albeit on the supply side, is the limited capac- 
ity to serve the clientele since attendance is roughly limited by the numbers of available 
seats. Allowing a greater number to attend generates congestion which is strongly re- 
sented by most participants; this is one of the main issue addressed in the next chapter. 
In a free market economy where prices tend to adjust to equate demand and supply, one 
would expect suppliers of social events to raise their prices at peak time so as to force a 
demand reduction down to their capacity (since this is profitable). Yet, prices tend to be 
sticky which often results in queuing or overcrowding. 24@ 

A technical reason is that demand varies a lot between peak and off-peak times, thus 
prices would have to be constantly adjusted which is extremely costly to do in all the 
places where prices are displayed on boards and menus. Furthermore, in order to dis- 
cover the market clearing price firms would have to perform auctions. This is what is 
actually taking place to resolve overbooking in planes: 25@ when there are more passen- 
gers waiting at the gate than available seats in the plane, an auction is conducted where 
a prize constantly growing is offered until enough people accept to board in a later plane 
or redeem their travel. 

A cultural reason is that consumers of social events believe they deserve stable prices 
so that any attempt to implement time dependent pricing is seen as gouging and back- 
fires into boycott. 



Resale 

Oddly enough, there are many events like concerts or sports where there exists a very 
active second hand market that no consumer resents. These markets are operated by 
brokers (legal entities) and scalpers (dodgy people on the streets) who are pure arbi- 
trageurs; they buy tickets at regular price in advance to resale them close to event time 
at the price that clear the market (since at that point their expense is a sunk cost). Bro- 
kers and scalpers are local intermediaries who have an informational advantage over the 
promoter regarding the evolution of local demand as the time of the event approaches 
(i.e., whether the event will be "in" or "out"). As shown in Courty (2003), this enables 
them to track the market clearing price and out-perform the promoter if he were to com- 
pete with them in the late sales market. Two consequences emerge: first, there is a 
profitable late sales market which explain why arbitrageurs appear in the first place, 
second, the promoter is bound to lose the competition against them which explain why 
arbitrageurs remain. 

Because promoters lose profitable late sales to resellers, they often lobby their law- 
makers for legal protection through prohibition or limitation of ticket resale. Another 
route against arbitrage is to personalize the emission as done by airlines although this 
is resented by consumers who see it as an undue introspection into their private matters 
and creates a value damaging inflexibility (you can't give away your ticket to a friend in 
case you won't be able to attend). 

Arbitrage is also reinforced by the underpricing of early tickets. Promoters do so for 
two reasons. The first motive is to generate the positive demand externality alluded to 
before. The second motive is risk management. As we saw in §19.3 on pricing under 
risk, a risky promoter tries to avoid loss making situations, here the case where tickets 
are overpriced and sell poorly. 

Because prices are sticky, there are some episodes of rationing at peak time i.e., why 
we have to queue at restaurants, disco clubs, theaters, cinemas or stadiums. 

Goods 

Producers of toys, microprocessors, game consoles and other fashionable electronic ap- 
pliances frequently resort to rationing when they launch a novel item whose qualities 
generates strong demand. Either a quantity known to be lesser than demand is brought 
to the market to force a "buyer frenzy" or sale is time limited to force "early purchase". 26 ® 
Raising the price when demand grows large is the efficient way to clear the market 
because it guarantees that the available units go to those who value them most. This 
adaptive behavior amount to price discriminate among peak and off-peak demand (cf 



§4 on price discrimination). By sticking to its original price and rationing part of her 
demand, a firm foregoes an immediate profit; some benefit must therefore accrue from 
the commitment to perform rationing and not raising prices. 

According to Gilbert and Klemperer (2000), this gain is identified by looking at the 
consequences of rationing. Those who suffer from it are the high-WTP individuals facing 
the prospect of being rationed when they would be assured to consume if price discrimi- 
nation was used. 27@ At the same time, low-WTP individuals that would never consume 
under price discrimination are sometimes lucky enough to get served under the rationing 
scheme. This last observation is at the root of rationing: the firm wants to give surplus 
to low-WTP individuals in times of peak demand to motivate them to become clients 
because this has a sunk cost. 

Consider indeed the decision to build a computer around a microprocessor, to buy a 
new game console or get a culture. The future client must incur an irreversible cost to 
learn how to use the good or service he will later buy. He will sunk that money only if 
he perceives a high enough future surplus from his consumption of the good or service. 
Due to the nature of the good or service under consideration this client, like any other, 
expresses revolving high and low demands. If the firm price discriminates, someone with 
low intrinsic interest in the item will be barred from consuming at times of high demand, 
he will therefore derive surplus from consuming at times of low demand. 

If the firm wants to lure that person into becoming her customer, she must compen- 
sate his sunk cost by lowering the off-peak price to generate enough surplus. This is a 
costly move because the discount is enjoyed by the entire customer base. Rationing can 
improve because it gives this marginal low-WTP individual the opportunity to consume 
at times where he most enjoys the item i.e., to derive a large surplus. The firm can then 
raise the off-peak price and avoid the aforementioned costly rebate. 

Model: consumer i e {H,L} has a valuation vt for the item with probability r ; and no 
interest for it otherwise. W.l.o.g., Vr > vv, we then assume that r# > ri i.e., high-WTP 
clients are more likely to be observed. 

24.5.3 Group behavior 

The previous models were micro foundations of how individual choices are influenced by 
their environment beyond the intrinsic qualities of the goods and services they consume. 

Segregation 

We present here Schelling (1969)'s model. Population is divided into n t incumbents (citi- 
zens of the local state) and n e entrants (immigrants) who share a preference for the city's 



central neighborhood. The dichotomy could apply to group membership for men/women, 
toddlers/teens, classic/vanguard, black/whites, gay/straight, christian/muslim or any other 
cultural trait. 

The next ingredient is to consider the heterogeneous acceptance of diversity i.e., each 
person subjectively tolerates a maximum mix of types (for the area); upon overshoot- 
ing that limit, he/she moves to another neighborhood, presumably, where his/her type 
dominates. Let assume that the tolerance index is uniformly distributed in each sub- 
population between (outright racists) and A (ultra-progressives). 

In each population of size n, let us order people by decreasing tolerance and relabel 
each by his rank from 1 to n. The number of people with tolerance greater than t is 
x = ^n, thus t{x) = A(l -xln) is the tolerance of individual x. If the most tolerant people 
in numbers Xt and x e live in the neighborhood, the marginal incumbent is going to stay 
if his tolerance £*(*»*) is greater than the mix ratio x e lxt. If he stays, so do all other more 
tolerant incumbents. The stability condition is thus x e < A{xi-x^/rii). By symmetry 
the stability condition for entrants is x ; - < X[x e - x 2 e ln e ). Taken with equality, these two 
conditions give rise to tolerance curves. 

The left panel of Figure 24.5 depicts them for equally sized populations (tit = n e ) and 
limited overall forbearance (A = 2) which means that the average person tolerates a one- 
to-one mix i.e., would agree to a 50/50 sharing of the neighborhood. Consider a pair 
[Xi,x e ), if it lies below the incumbent curve, the neighborhood is attractive to additional 
incumbents while if it lies above, the neighborhood is repulsive and forces some incum- 
bents to move out. The same applies with respect to the entrant's tolerance curve; we 
draw arrows to indicate in each zone how the current numbers of incumbents and en- 
trants changes when marginal people find out whether the neighborhood is attractive or 
repulsive to their group. 

An equilibrium is found at the intersection of the curves (diamond) where the tolerant 
halves of each population live together but this outcome is unstable since any small devi- 
ation triggers a process of adaptation that ends with complete segregation at either dot 
on the axis. Whether the neighborhood ends up filled by incumbents or entrants depends 
on the respective speeds of change in each group and on the historical distribution. 

Hope to see a mixed outcome is not lost however. The right panel of Figure 24.5 
uses a greater approval for diversity (A = 5) with the effect that three mixing outcomes 
are candidate equilibrium, one of which (middle star), is stable. Yet if the incumbent's 
population is twice the entrant's then the entrant curve shrinks down and the stable 
mixed outcome disappears, segregation follows again. 



When the overall degree of approval for diversity is large enough, a mixed outcome 
can become a stable equilibrium, otherwise complete segregation takes place and is 
determined by historical conditions. 




incumbents n; q incumbents 

Figure 24.5: Location equilibrium 



Rational rationing 

Becker (1991) reports having observed that two nearby restaurants offering similar ser- 
vices get vastly different demands. One is packed and has a long waiting queue while 
the other one is almost empty. This pattern can persist for a long time and strikingly 
the successful owner does not raise his price as standard microeconomic theory would 
predict. Alike phenomenon occur for the price of best-sellers books or top sport events. 
Eating out satisfies consumer's hunger but also their social needs (being "in"). In that 
case, the social status of the restaurant is its popularity. 

On Figure 24.6 we represent a traditional demand function, the marginal revenue 
function and the optimal price p* together with the corresponding sales q* that are 
lesser than the restaurant's capacity k. What the traditional demand theory neglects 
is that the individual demand depends positively on the expected attendance a at the 
restaurant. 

The effective demand of the restaurant is a function D{p,a) of price p and expected 
attendance a. In equilibrium, attendance is equal to demand thus D{p,a) = a which can 
be inverted into p = P{a). Differentiating, we obtain P'D p + D a = 1 



=> P' = K^ which is 



D n 



positive if the externality is strong enough i.e., when D a > 1 (recall that D p < remains 



true). The price can thus increase with the expected occupation but such a phenomenon 
cannot last forever so that the network effect D a will fall back if the expected occupa- 
tion is too large. The price function is then bell-shaped and reaches a maximum at p* 
inducing a demand a* . 




Figure 24.6: The Restaurant Puzzle 

Let p be the market clearing price i.e., solving D{p,k) = k. The maximum price p* 
generates queuing if a* > k o P' > o D a {p, k) > 1 o D{p, k + 1) > D{p, k) + 1 which means 
that, at the market clearing price, the demand increases by more than one table each 
time a new table is added to the restaurant. 

Assuming that this condition holds, the maximum price p* is optimal. A lower price 
entails at best the same effective demand but a lower margin, it is thus a dominated 
choice. Any higher price generates a zero effective demand because potential customers 
revise their expectations downward and start to leave the queue in front of the restau- 
rant (technically, for p > p* there is no a such that D{p, a) = a). 



24.5 .4 Social Cohesion 

Introduction 

In the business environments where cooperation is key to produce efficiently, the mod- 
els of labor discipline tell us that a rent has to be left to employees to induce obedient 
behavior (cf §20 on moral hazard). This view probably fits well the rather anonymous 
work relationship observed in advanced economies but stands in starck contrast with 
the business practices of ethnic communities whose members accept low wages for long 
periods without creating negative side effects on labor productivity or threatening their 
cohesion. Chinese, Jewish or Italian communities among others have always displayed 



this communitarism in the many countries where they migrated. A simple model of 
social network externalities can shade light on the economic forces behind their social 
organization (beyond the obvious cultural reasons). 

The idea is that paying a negative rent to workers enables the community firms to 
generate a greater cash flow so that new investments can be financed internally and 
rapidly in opposition to the lenghtier and dearer use of external funds. To sustain this 
apparently exploitative social organization, a carrot and a stick are necessary; the former 
is a siginificative probability of inheritating the business for people who have no initial 
rights on it while the second is ostracism for those who betray the social order. 

Model f 

Let us denote 6 the discount factor, to the wage inside a community firm, T the number 
of employees in a community firm and p the periodic return of the entrepreneur; the 
present value of being a community entrepreneur is thus V e = p ~_g . An obedient member 
of the community has a positive probability n of inheriting the business i.e., become en- 
trepreneur later on. 28@ Free-riding on the community while being a member is possible 
given the delegation level and yields an opportunistic wage a greater than the outside 
wage to; yet, since information flows perfectly in the community, the cheater is identified 
and fired which means his outside opportunities are reduced by an ostracism factor e. 
We assume p > a>to; the present value of cheating (c) is 

1-e 

V c = a + 6V = a + 6 -to (24.3) 

l-o 

while the present value of behaving honestly (h) is 

V h = to + 6(l-n)V h + 6nV e (24.4) 

thus 

to + 6nV e 6np + {1-6 -8nT)to 

Vh — — 

l-5(l-7r) (1- 8) (1- 8(1- n)) 

is an increasing function of insider wage to, thus 

(1 - 6) (1 - 5(1 - n)) [a + 8±E§co) - 8np 
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The minimum wage to_ necessary to sustain patient behaviour within the community 
increases with a, to and T but decreases with e, p and n (for the latercondition, V e > V c is 



assumed). Observe that in absence of network effects (n = e = 0), the inside wage has to 
be greater than the outside one since it reduces to o> = (1 - 5) V c = (1 - 8) a + 8U) > a). 

The insider wage w > co will have the appearance of exploitation if it is lesser than 
the outside wage &; this is possible only if the business is profitable enough: 

w<(b o 0)[6{.l-6{.l-n)){l-e)-l + 6 + 6nT]<6np-a{l-6){l-6{l-7i)) 
Xa) + aa-8)a-6{l-n)) 



p>p 



6n 



where X = 5 [I -5(1 - n)) (1 - e)-l+6+6nT can be positive or negative if the ostracism factor 
is large enough. 

Summarizing, we have shown in a simple agency relation that the expected value 
of being diligent increases with both insider wage and profitability of the business. To 
sustain diligent behavior within the firm over cheating and quitting, insider wage has 
to be large enough. However, if profitability is high, the insider wage may still appear 
exploitative i.e., be less than the outside wage. 



Chapter 25 
Network Congestion 



Service industries based on physical networks like the transportation of people and mer- 
chandise (road, rail, air, water), the transportation of energy (electricity, gas, oil) and 
communications (radio, TV, internet, telephone) display scale economies which give rise 
to a technological positive network externality: once available and consumed by some 
people, additional persons can enjoy them without increasing much the cost of provi- 
sion. 1@ In this context, natural monopolies are often the efficient market structure and 
regulation is called for to avoid abuse of market power (cf §17). On top of this, society 
views these as public services (cf. §16.1.1) which leads to choose pricing rules arbitrating 
between efficiency and fairness (preferential treatment for some classes of consumers). 

This chapter addresses the issue of optimal network size. Indeed, a regulated price 
close to marginal cost is presumably low, thus likely to generate a strong demand. At 
some point however, the physical network becomes saturated by excessive use, thereby 
generating a negative network externality called congestion. We shall firstly delve into 
the origins of congestion and explain why the law of demand fails to eliminate it. We then 
look at the intuitive response consisting in increasing the network size only to expose 
its hidden costs. Then, we present peak load pricing, the major theory addressing the 
problem together with a list of cases and their remedies. Whereas most of the chapter 
uses electricity as the underlying good to illustrate our subject, the last section turns to 
road congestion to broaden our point of view. 

25.1 Roots 

In this section, we take a close look at seasonality because this phenomenon is at the root 
of congestion in networks. We also introduce the notion of arbitrage as a force tending to 
mitigate seasonal congestion. 



25.1.1 Seasonality 

Seasonality or cyclicality in economic activities originates in nature and customs; the 
main historic forms are 

• Daily: farming takes place from dawn to sunset. 

• Weekly: Sunday for Christians, is devoted to worshiping the Lord and rest. 

• Yearly: the first people who could leave for vacations used to do it during the sum- 
mer to enjoy the better weather. 

• Weather: the wide changes in exterior conditions (e.g., heat, cold, humidity) drive 
our energy use, mostly destined at countering their effects. 

Many of these patterns have passed into regulation or norms to organize society's ac- 
tivity during day, week and year in order to achieve a better coordination both at work 
and at home. This collectivism makes herding a dominant strategy since everything is 
organized for those who follow the rules. For instance, 

• Night buses being fewer, it is more lengthy to go to one's night job. 

• Shopping on Sunday is difficult since most shops are closed. 

• The August slowdown-when most firms are closed for vacations-makes all eco- 
nomic activities dearer since the degree of competition is reduced. 

• Labour productivity is significantly lower under extreme temperatures. 

Breaking society's norm and choosing a path of action different from the majority is 
an expression of one's freedom in front of an alternative; 2 ® practically speaking it means 
to arbitrate or decide. Since this choice is not without consequences, we may say that 
there is an opportunity cost of arbitrage. We shall explain promptly the similarity with 
the well known concept of financial arbitrage. Table 25.1 summarizes our approach to 
cyclicality and the human reactions to it. 



Actor 
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Society 

Individual 
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Freedom 


Natural Seasonality 

Societal Seasonality 

Arbitrage 



Table 25.1: Seasonality and Arbitrage 



25.1.2 Congestion 

Seasonality of demand is rarely a problem for the supply of goods because they can be 
cheaply stored in advance. For instance, new cars or new electronic devices are produced 
during a few months and stocked in advance of the first day of sale; this way the manu- 
facturer can respond to demand if the product is a success and avoid to enrage rationed 
"would be" clients. 3@ When it comes to services, the instantaneous supply is limited 
by the currently installed capacity measured either by available employees (labour) or 
machines (capital). In labour intensive activities like retail or personal services (e.g., 
haircutting or catering), it is relatively cheap to meet demand peaks by recruiting more 
part-time employees. Transportation activities, on the contrary, are capital intensive. In 
these industries, the technological obsolescence and the necessary maintenance of cur- 
rent equipment makes it very costly to increase the installed capacity to the point that 
any peak of demand can be met. Hence these transportation systems are likely to suffer 
congestion during a small period of time, the peak, and extensive under-utilization dur- 
ing the remaining time, the off-peak period. Table 25.2 summarizes our classification. 
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Table 25.2: Impact of Congestion 



25.1.3 Arbitrage 

Almost a century ago, Pigou (1920) observed that taking the London subway at 8h in 
the turmoil was unpleasant while traveling on the same route at 15h could be enjoyable. 
Although the fare is the same, the quality of service is worse in the morning as if the 
journey was more expensive. There is usually an opportunity cost of congestion for the 
user either through lost time or worse quality of service i.e., he would agree to pay more 
to avoid the crowd at the crowded time. 

Since it is physically impossible to avoid the crowd at the peak time, one has to travel 
off-peak to avoid the crowd i.e., to arbitrate. However, switching from the peak to the 
off-peak period to travel may be difficult or, stated in economical parlance, expensive; 
there is thus an opportunity cost of arbitrage. 

These two opportunity costs are key to understand the relation between arbitrage 
and peak load pricing. If the cost of congestion is greater than the cost of arbitrage 
then it is worthwhile to switch consumption from peak to off-peak period. Yet, customs 



are strongly binding for most people which means that arbitrage is extremely costly for 
them; as a result few are naturally inclined to perform the direct arbitrage. 

If firms could serve whatever demand addressed to them, they would not mind this 
situation and would keep charging a single price for peak and off-peak service. Under- 
standing that their installed capacity is usually lesser than the peak demand, firms try 
to optimize the returns on their valuable assets. The trick is to retain those clients will- 
ing to pay more for the on-peak service and to retain exactly as many as the installed 
capacity enables to serve. The other clients who will be rationed (with respect to their 
first choice) are then compensated in order to foster their use the off-peak service and to 
lose their patronage altogether. The emergence of two different prices for the two periods 
is peak load pricing. 

Practically, the available capacity is auctioned to the users most eager to use it by 
increasing the wedge between the peak price ph (high demand) and the off-peak one pi 
(low demand) which, under traditional pricing, is nil. A consumer will arbitrate if his 
net utility of the service is greater during the off-peak period i.e., when 

utility cost of peak utility cost of off-peak 

peak congestion price off-peak arbitrage price 

, , . . „ I cost of off-peak cost of ] 

o Arbitrage if < + > } (25.1) 

I congestion discount arbitrage J 



For many services, arbitrage does naturally take place and, as we shall explain here- 
after, should be encouraged using peak load pricing to smooth out congestion. This re- 
duces the inefficiency or hidden cost thereby generated at society's level by the very 
existence of congestion. For instance, some people negotiate with their firm a shift of 
their workday by several hours to avoid traffic jams, other leave for vacation out of the 
season (and go to tropical countries where the weather is still fair) while some move 
their residence nearby their working place or look for a job nearer to their home. In 
many sectors and industries, auctions are used to sort out potential clients by their will- 
ingness to pay. Congestion in (regulated) transportation has become such a big problem 
that seasonal discrimination which was previously prohibited on grounds of fairness is 
now being introduced in more and more places like highways and city centers. 

For material goods, both opportunity costs in (25.1) tend to be very low. Indeed, there 
is almost no congestion cost for goods because firms can continuously adjust supply to 
demand (storage and transport are cheap). Next, the opportunity cost of arbitrage sums 
another low storage cost (cupboard or freezer) to the opportunity cost of money if the 
good is bought in advance (to arbitrate). These observation means that arbitrage is 



solely governed by the off-peak discount or, as buyers tend to see it, the peak overrating. 
Therefore, firms are prevented to take advantage of the higher peak demand and this 
is why luxury items like "foie gras", champagne or perfumes which are mostly bought 
around Christmas time do not suffer wide price variations during the year (cf §4 on 
price discrimination). One of the few exception is fresh sea food because freshness is a 
non storable attribute. 



25.2 Capacity Expansion 



In this section, we first show that congestion mitigation by capacity expansion has a hid- 
den cost, then we explain how road network effects can mare completely the supposedly 
obvious benefit of capacity expansion. We conclude with an application to the internet 
and communications. 

25.2.1 Hidden Cost 

Seasonality 

The customs and regulations that create the seasonality of demand generate an obvious 
hidden cost. Take for instance the economic activity in a major city; on working days, 
10 million people use the public transportation system (TPS) to commute to work while 
on week-ends nobody does. This makes 50 million trips a week, thus if these could be 
spread over the whole week, daily trips would fall to 50/7-7 million/day thereby gen- 
erating a 30% saving for the transportation system either through less investment and 
less time lost in traffic jams. Obviously, this reallocation of working time over the whole 
week would necessitate a revolutionary change of mind and organization in that society. 
This example nevertheless demonstrates that every seasonality generates a hidden ca- 
pacity cost; it occurs as soon as production of the service necessitates a costly physical 
equipment and demand for using the equipment displays seasonality. The ideal, cost 
minimizing solution, would be achieved if usage could be made constant over time. 

Price Discrimination 

Consider an airline facing strong demand during the summer for an holiday route. The 
investment into a new unit of capacity, an additional plane, will be launched only if 
the cost is lesser than the expected return, the latter being the product of a price by a 
quantity. The price is the highest one that a typical vacationeer would agree to pay while 
the quantity is the expected yearly number of holy-days. It is clear that if, for cultural 



reasons, strong demand is restricted to a few days, it won't pay for the firm to increase 
capacity; thus there will be congestion. 

Fitting the available capacity to demand is called rationing and there are many ways 
to apply it. Yet, the most profitable one is to increase the price up to the moment where 
demand shrinks to exactly the capacity. Tickets on a plane that serves only a holiday 
route during the summer are thus dearer than tickets on the regular all-year-long ser- 
vice on the same route. This discriminatory pricing behavior thus enables the firm to 
maximize revenue on the holiday route and will induce it to increase capacity on that 
route. Congestion will be reduced but never eliminated because each new unit serves 
additional clients with ever decreasing WTP, hence fetches lower and lower revenues 
(up to the point where the cost of the unit would cease to be covered). Underlying dis- 
criminatory pricing is the fact that the additional planes put for service on the holiday 
route enable the provision of a new service, holiday travel; this service is therefore priced 
differently from the original one. 

Nevertheless, such discriminating schemes are prohibited by regulators for services 
to the general public such as energy, water, buses or highways. Acting on ground of fair- 
ness and efficiency, unit rates are generally set at a low level so that the service receives 
an extremely large demand in peak periods. Since rationing is almost invariably viewed 
as a sign of underdevelopment (or a remembrance of war times), it is a politically unten- 
able situation. Legislators and the government therefore thus make it an obligation for 
the demand peak to be served by recurring to capacity building. The additional cost is 
then passed rather uniformly to consumers either in a lump sum way with an increase 
of the subscription or through a rate increase. Society thus ends-up paying a high bill 
for the service in order to indulge itself the pleasure of taking communion all together. 

Under the no-discrimination rule imposed by politicians to regulators, nothing is done 
to moderate the naturally large demand occurring at some specific and unfrequent mo- 
ments. Peak load pricing aims, on the contrary, at computing (and applying) the efficient 
scheme of price discrimination that equate marginal benefit to marginal cost. This is 
way an optimally sized capacity is build. 

25.2.2 Application to Communications 

We apply to internet, "the highway of information", a classic analysis from the economics 
of public goods. Browsing speed on the internet is sometime slow because the resource 
being almost free we tend to overuse it. Indeed, we only need to pay a small fee 4 ® per 
hour to surf anywhere on the web so that when a great event takes place, millions of 
users try to access the pages giving realtime information (images, sound or video) on the 



event; as a result, the server gets congested and the users have to wait a long time before 
receiving the response to their request. This excess demand is caused by the zero price 
of the service and generates a great loss of utility; it is therefore more efficient to set a 
positive price to the service even if its cost is zero in order to reduce the waiting time of 
the vast majority of users. 

Let us analyze this claim in a simple model. Consider potential users i = 1 to n, each 
characterized by the flow q% of information he transmits through the network (per hour). 
Denote Q = Y.i< n qi be the total amount transmitted and K the capacity of the network. 
We assume that each user displays decreasing returns to consumption and suffers from 
congestion in proportion of the delay he suffers which is measured by the ratio Q/K; we 
thus posit UiiqO = s/ql-pqi - cQIK where c> 0. The maximization of utility leads to the 
FOC, yjw ~ P + T ( s i nce Q = qi + tf-i)- If internet is free (p = 0), the optimal individual 
consumption is q° = K 2 /4c 2 leading to an aggregate of Q° = nK 2 /4c 2 . It is readily observed 
that total usage Q° increases with capacity K but decreases with the delay factor c. We 
notice that the congestion index Q/K is, in equilibrium, proportional to capacity by a 
factor nl4c 2 . A 10% increase in capacity will trigger such a large increase in demand 
that the congestion index will increase by the same 10%. This is a well known result 
according to which 

| An increase in transportation capacity ends-up creating even more congestion. 

The socially optimal use of the network maximizes the total utility minus the cost 
of providing the service which, for simplicity, we assume to be zero (it is quite small 
compared to the fixed cost of setting up the network). The objective is thus E ; <« ut = 
n [y/q- c^-). At the social optimum, the price is set equal to the marginal cost, here zero, 
hence the FOC is jj= = ^ and its solution is q* = -^j = q°/n 2 , leading to total usage Q* = 
Q°/n 2 . We may thus conclude that free internet is overused by a factor n 2 independently 
of the opportunity cost of delayed surfing c. Maximum welfare, equal here to consumer 
surplus, being W^ = ^-, society's willingness to invest into new capacity (internet server) 
is -. It is now natural to search for a price that decentralizes the optimum; we look for 
p* such that the individual FOC is satisfied at the efficient consumption q* i.e., 

1 . c „ 1 c (n-l)c 

= p + — o p = 



2yfq* K 2^/q* K K 

This optimal price increases with the number of potential users when the capacity is 
fixed. Using this optimal price, an increase in capacity enables a price reduction but still 
increases congestion since Q* IK is proportional to K. 

If access to the Internet event is managed by a monopoly, most often the owner of 
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the broadcasting rights, he shall set p to maximize U{p) = npd{p) where d{p) = ^-^ 
thus profit is proportional to ^ + 2 and maximum for p M = ~ which leads to total traffic 

Q M - jfr = %• The monopoly is aware that a high price reduces demand (as usual) but 
it is also cognizant that this indirectly reduces congestion, thus increases willingness to 
pay and generates a positive feedback on demand. This is why, in our simple example, 
the monopoly price rises more, leading to a monopoly demand that is only a quarter of 
the competitive demand (p = C m = is the competitive price) to be compared to one half 
in the absence of externalities. Yet, the monopoly production is still inefficiently large 
because the congestion externality is mostly ignored. 5 ® 

Observe also that since the monopoly profit is U M = f^, his willingness to invest 
into new capacity is y^; this level is proportional to the number of potential users and 
inversely proportional to their delay cost (which confirms basic intuition) but above all 
it is greater than the social willingness to invest which shall lead to over-investment 
because, in our simple model, users value capacity and are ready to pay for it. 

Although appealing, this model should not be over-stretched. No general conclusion 
can be drawn here because the changes with respect to traditional local equilibrium the- 
ory "a la Marshall" utterly depend on the linearity, concavity or convexity of the perceived 
cost of congestion about which we cannot say anything beforehand. 

25.2.3 Application to Air Transport 

Air traffic grows in Europe but delay growths faster. According to Eurocontrol, move- 
ments rose 4% per year in both 2004 and 2005 to reach more than 9 millions (per year) 
but the number of delayed flights rose to 40% and then 43%. Likewise the percentage 
of flights delayed by more than fifteen minutes rose to 18% and then 20%. The average 
delay per movement rose to 11 minutes increasing by about 8% every year which is about 
twice the increase in activity. 

Legacy airlines used to connect directly all served cities, leading to the so-called fully 
connected (FC) shape of their networks. Today, the Hub and Spoke (HS) 6@ network 
has become the standard shape; it was originally developed by FedEx for rapid freight 
delivery and by Delta Air Lines for passenger transportation. Brueckner (2004) offers an 
explanation for the dominance of this new organization based on the optimal behavior of 
a monopolist operating a triangle network where the top vertex can optionally serve as 
a Hub to connect indirectly the two lower vertices. The HS structure dominates the FC 
one if fuel cost and/or waiting time are low enough. 

We saw in §3.3.2 on quality choice by a monopolist that the profit over a route is 
n = qP{q,s) - C{s,q) where P{q,s) = ^fi- - - and C{q,s) = 0s + cq. Under FC, the cost of 



a single route is C(q\,s) where q\ is the demand for that route. Since all three routes 
receive the same demand, total cost are 3C{q\,s). Revenues are three times those of one 
route, thus profits under FC are n FC = 3n{qi,s). With an HS network, the cost of a single 
route is C{q\ + q2, s) where q\ is the demand for direct flights while q 2 is the demand for 
connected flights. Total cost, using only 2 routes for 3 cities, are thus 2C{q\ + q 2 ,s). The 
treatment of revenues for the HS firm is more complex than for the FC one. Revenues 
over the two direct routes are twice those of one route but revenues for the connected 
route are lesser because airplane time is twice and there is some waiting time at the 
hub. To model this fact, we assume that the WTP of consumers is diminish by a constant 
factor ix. Profits are thus 

n HS = 2q 1 P{q±,s) + q 2 (P{q2, s) - y) - 2c{q x + q 2 ) - 26 s 
The FOCs for quantity and quality in the FC network are 

T n 

a-bc- L = 2q 1 and q x = -r (25.2) 

s 7 

while the three FOCs for the HS network include the RHS of (25.2) together with 

a-\i-2bc — -2q 2 and q\ + -q 2 = -s 2 (25.3) 

The system (25.2) has a unique economically relevant solution [q FC , s FC ). Since the 
RHS of (25.2) applies in both situations, if we set initially s HS = s FS , then we must have 
q^ s = q FC . Now, assuming that 2q FC > 11+ be, the LHS of (25.3) yields a positive q^ s - 
Plugging this information into the RHS of (25.3) and comparing to the RHS of (25.2), we 
see that s HS > s FS must hold. Looking back into LHS of (25.2), we get q^ s > q FC . The 
increase in s HS also increases q^ & '. Since both HS quantities are greater than before, the 
solution of the RHS of (25.3) is an even greater s HS . This process converges because the 
changes are smaller and smaller. We have thus shown that adopting the HS structure 
involves higher flight frequency (s HS > s FS ) and larger traffic on direct routes (q^ s > q FC ). 

From the RHS of (25.2) applied to FC and HS, we get %? = [j^J > (^J since s HS > 

s FS , thus k HS = %g > %c = k FC - Since the k ratio is a proxy to plane size, the HS structure 
also involves bigger planes. Lastly, Brueckner (2004) shows that the HS network is 
conducive of greater profits i.e., more likely to be the optimal shape, if the parameters \i 
and c are small. 



25.3 Peak Load Pricing 

As compared to capacity expansion, peak load pricing is an alternative and more savvy 
way to equate demand and supply; it fosters consumer arbitrage and thereby increases 
the off-peak demand while decreasing the peak one; for that reason one also speak of 
"peak shaving" or "valley filling". We illustrate in details its functioning using the case 
of electricity. 

25.3.1 Variability and Seasonality 

Figure 25.1 displays the instantaneous demand for electricity in France, Italy and Spain 
over the day of greatest demand in 2003/2004. 7@ The graphs clearly show many vari- 
ations in demand during the day; they result from the living habits of households and 
from the legal pattern of working hours in businesses. There is however an important 
difference, the small peaks at 22h and 2h in France are absent from the other coun- 
tries; they are due to the extensive use of peak load pricing schemes by french industrial 
consumers. 

Non cyclical events such as a solar eclipse or a football match can also provoke large 
demand variations. On 11 August 1999, many people in the UK stopped their normal 
activities to watch the solar eclipse, the electricity demand fell sharply during 20 min- 
utes, as if 4 million consumers had suddenly disappeared. At the end of the solar eclipse, 
demand rose again back to normal in only 20 minutes. Likewise, when watching TV, we 
stop using other appliances and therefore reduce our electricity consumption. This last 
until the commercial break when everybody turn on the lights and goes to the kitchen to 
prepare coffee or tea. The resulting sharp increase in electricity demand is called a TV 
pick-up. 

Figure 25.2 displays the French demand over the year 2003 in two ways; the first 
graph simply records the weekly consumption and displays it according to calendar date. 
To understand its shape, it is enough to recall that electricity demand is inversely related 
to outdoor temperature and that France is dominated by Atlantic weather conditions. 8 ® 
The bottom graph, called the load duration curve, displays the same information except 
that levels of hourly demand are ranked according to decreasing demand. It is readily 
observed that to meet the maximum demand of that year, which is 83 GW, the utility has 
to maintain available a larger capacity, here 91 GW. The latter is itself lesser than the 
installed one, 116 GW, because some units are necessarily out of line for maintenance. 9 ® 

To see the hidden cost of excessive capacity, imagine that industrial demand can be 
coordinated by a central planner so as to become perfectly correlated to the opposite of 
the residential demand; this means that at any hour of the year, total demand is equal 
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Figure 25.1: Variability of Electricity Consumption 



to the average 53 GW. In suh a situation, the installed capacity could be halved to 58 GW 
without compromising security of supply (the fact that a unit can go down unexpectedly). 
Once we know that the cost of capacity is more than one half of the total cost of electricity 
the electricity bill for the country would be reduced by at least a quarter. 10@ That may 
be enough to negotiate with industrial users a smoother use and change this dream into 
reality. 

Not everything is lost in the current situation; thanks to the existence of interconnec- 
tions with neighboring countries, the excess of available capacity over internal demand 
can be used for exports. 11@ By widening the network, the variability of demand is re- 
duced since cold days in the north with heavy use of heating are fair ones in the south 
and hot days in the south with heavy use of air-conditioners are fair ones in the north; 
hence the need for extra capacity is reduced if the generated electricity can be trans- 
ported from north to south and vice versa to meet demand wherever it is located. The 
European integration of national electricity markets shall yield a hidden benefit by re- 
ducing the hidden cost of excess capacity. France provides a clear example; this country 
is at the moment the largest European exporter of electricity and is a net exporter at 
almost every hour of the year. However, during a few very cold hours on 1/3/05, net 
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Figure 25.2: Load Curves for Electricity Demand 

imports came from Germany and Spain to save the utility from having to start its most 
expensive generation units. 



25.3.2 Optimal Peak Load Pricing and Capacity Choice 

Problematic 

Peak load pricing is an old commercial practice akin to price discrimination that was ne- 
glected by academics until Boiteux (1949) made clear its relation to long term efficiency 
in the sector where congestion is most pressing: electricity. 12 ® 

Indeed, congestion in an electrical network is simply unsustainable; either the op- 
erator rations some consumers or the whole system fails i.e., a black-out occurs. Both 
black-outs and rationing have and continue to have a very negative political backlash; 
governments therefore always take care to instruct regulators to put reliability as the 
number one requirement of the integrated utility. Since these events are to be avoided at 
all costs, the utility is lead to build a very large transport capacity together with a very 



large generation capacity to be sure that any peak of demand anywhere in the service 
area can be securely met. 



Analysis 

When the demand for a service is constant or changes at a sufficiently slow pace to enable 
capacity adjustments, the relevant time frame for cost is the long-term (LT). Capacity is 
therefore set at the efficient production level equating willingness to pay and marginal 
LT-cost, the latter determining the price. The left panel on Figure 25.3 shows this for 
two demands labeled low (/) and high (h). 

If there are two periods of low and high demand that alternate faster than the time 
needed to adjust capacity each period will necessarily have to be treated from the short- 
term (ST) point of view i.e., the relevant cost function is the ST one. To analyze the 
setting of prices we shall assume that there is no arbitrage possibility so that the demand 
in each period remains independent of the price set for the other period. In each period, 
efficiency commands to price at the marginal ST-cost. 
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Figure 25.3: Flexible and Rigid Intermittent Demands 



We know from §2.1.3 that the ST and LT marginal cost coincide for just one quantity 
and since the ST function is more convex than the LT one, it is optimal to set the capacity 
equal to the peak demand (this is quite clear for the rigid technology on the right panel of 
Figure 25.3). Therefore, the peak quantity is the LT-efficient one, q* h , while the off-peak 
quantity is qf, the latter is only ST-efficient and greater than the LT-efficient level q* t . 
Indeed, since the installed capacity is greater than the off-peak demand (when priced at 
the ST-efficient level), production is artificially cheap. 



Interpretation 

We can now interpret the ST marginal cost at off-peak demand pi as a unit cost c of 
service for one period while the difference with the marginal cost at peak demand ph - pi 
is seen as the unit cost of capacity expansion; we denote it 2 r in order to interpret r as 
a per period cost of capacity expansion. A cycle sums the peak and off-peak periods; our 
notation makes clear that peak users are alone to finance the capacity since they pay c 
for their energy r for the extra capacity they need and another r for a capacity that is 
shared with off-peak users. As a matter of example, consider a business hotel for which 
the week-end is the off-peak period. The unit cost of providing a room for one night is 
the labour cost of cleaning the room everyday while the cost of making one more room 
available (capacity expansion) is the capital cost of construction plus the labour cost of 
employees that are present during the whole week (cycle). The same dichotomy applies 
for airplanes. 

For transportation activities such as highways or trains that are still regulated in 
some countries, governments have generally been wary of discriminatory pricing and 
have tended to impose a minimum fairness in the design of tariffs. Uniform pricing is 
in fact the only way to share the capacity cost 2rq h in proportion of group consumption. 
Indeed, one easily checks that the unique solution to the system 

nqi = 2rq h and r h q h = 2rq h 

qi + qh cji + cjh 

is n = r^ = q * 2r > r since the uniform price guarantees that final demands satisfy 
q\i> qi. Contrariwise to the efficient discriminating solution, uniform pricing is unfair 
with off-peak users since they pay a price for capacity greater than if peak demand did 
not exist. 

Examples 

Modern regulators now take a closer look at the hidden costs of capacity expansion and 
accept some degree of peak load pricing to eliminate unnecessary investments. The 
case of road transportation is treated in the next section. For water networks, true 
peak load pricing would require real time metering which would be extremely costly 
to implement. Instead, the rate structure takes advantage of the near simultaneity of 
individual loads to overcharge heavy users using an increasing marginal price. This is 
why in most countries where potable water is not plentiful, efficient discrimination is 
applied with increasing block tariffs (cubic meters become more and more expensive). 
One of the first peak load pricing scheme is probably Boiteux (1957)'s "green tariff" 



that was implemented the following year by EDF in France. Industrial users were of- 
fered a discounted price for most of the time in exchange of paying a higher price during 
the winter for peak hours. 13@ Today, in the retail market, beyond the basic contract offer- 
ing a constant price of 107 €/MWh, EDF proposes a "time-of-use" (TOU) contract where 
the night time price is reduced to 64 €/MWh. This tariff tends to smooth out consumption 
across hours within any given day 14@ 

The previous characterization of efficient peak load pricing was based on the inverse 
demand curves (willingness to pay). A competitive firm would obviously price peaks in 
exactly the same way while a monopoly or any firm with market power would set higher 
prices although following the same approach i.e., equates off-peak marginal revenue to 
cost of service and peak marginal revenue to cost of service plus the total cost of capacity. 

Lastly, we must comment the reversal problem. If the peak demand priced at LT mar- 
ginal cost happens to be lesser than the current capacity then we have excess capacity so 
that none should be replaced meanwhile obsolescence naturally reduces the working ca- 
pacity. Likewise, if the off-peak demand priced at the (expensive) LT marginal cost is still 
greater than the current capacity there is an urgent need to build more capacity. Finally, 
there is the case where peak load prices inverts the original ranking of demands; in that 
case the peak-off-peak wedge must be reduced (while maintaining capacity financing) 
until the two demands become equal. 

25.3.3 Demand Side Management 

Demand Side Management is the commercial name for direct strategies of peak shav- 
ing that are an alternative to peak load pricing. The problem with the latter is that it 
leaves the client take his own consumption decision. If the law of large number can be 
applied then occasional strange behaviors are averaged out; however, if the cost of arbi- 
trage suddenly increases then the peak price will be too low to motivate arbitrage. 15@ An 
example is electricity prices during summer days; usually clients avoid using much their 
air-conditioner because of the high cost but if there is a heat wave they will surrender 
and switch the apparatus to the maximum, at least for a few hours. Since everybody in 
the area will do the same, the law of large number won't apply and demand will explode 
which is precisely the worst situation for the utility. To limit load (i.e., quantity), a direct 
negotiation with large clients can be carried out: 

• peak clipping: an interruptible contract whereby the client, mostly industries, agrees 
to be cut-off on short notice (for a limited time and in exchange of a bargain price). 

• load shifting: a packages of prices and quantities whereby the client agrees in 
advance to limit load during the known periods of peak demand. 



Modern methods of consumption smoothing involve a more active participation of 
consumers thanks to the progress of telecommunications (IT revolution) which enables 
to notify clients either of a price increase or of a rationing (to which they have agreed). 
EDF's large clients for which interruptible contracts can be proposed account for one 
third of total electricity output (more than household share); hundreds have contracted 
one of the above formula. 

A different scheme of DSM in use at EDF since 1996 is the "tempo" contract; this 
peak pricing strongly motivates the client to reduce consumption on some crucial days 
to compensate for the load increase of more inelastic clients. The 365 days of the year 
are distributed into 3 classes as follows: 300 days blue (cheap), 43 white (regular) and 22 
red (expensive). EDF later decides every day whether the next day falls in any of the 3 
categories with some limits and up to the exhaustion of the available days in each cat- 
egory 16 ® Table 25.3 shows the various unit prices associated with this flexible contract 
together with some EDF cost and prices for various fuels; one sees how the prices in 
the flexible contract match the market prices of the technologies used to meet peaks of 
demand. 



category 


blueN 


blueD 


white N 
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redN 


red D 


# hours 


2400 


4800 


344 


668 


132 


396 


€/MWh 


44 


55 


91 


107 


168 


470 


1997 data 














Marg. Fuel 


Nuke 


Nat. Gas 


Nat. Gas 


Gas Turb. 


Gas Turb. 


Oil Turb. 


Av. Cost 


41 


40 


40 


98 


98 


152 


Price 


32 


43 


72 


85 


136 


363 



D: day time (6h-22h) N: night time (22h-6h) 
Table 25.3: Demand Side Management 

The overall effect of the DSM scheme is to smooth out consumption across days within 
the week. Thanks to these peak load pricing schemes, the demand addressed to EDF is 
less variable so that maximal available capacity can be reduced while keeping the same 
margins of security of supply. 



25.3.4 Nodal Pricing in Electricity f 

Purely competitive trading states that the price of Port wine in England is that in Portu- 
gal increased by transportation cost. Transportation in a network is however limited by 
its size characteristics e.g., the number of lanes on a highway, the section of a pipeline, 
the voltage of an electricity line, the number of optic fibers in an internet backbone. We 
will show in a simple model of deregulation the nature of this problem. 



Consider two countries like France (#1) and Italy (#2) that regularly trade electric- 
ity. Deregulation has taken place so that there is a competitive market in each country 
where generators and distribution companies bid demand and supply. The excess de- 
mand function for region i is Dt(pi), the aggregate demand minus the aggregate supply. 
It is a decreasing function thus, for a net import load qi, the equation Dtipi) = q\ has 
a unique solution p t = Piiqt) measuring the value (shadow price) of an additional unit 
of imports at the current level qi. If region i was disconnected from the other one, local 
production and consumption would equalize and the price would be p° = P ; (0) as shown 
on Figure 25.4. Now, when net imports reach the level qi, the net surplus of electricity 
generation and consumption, called the regional benefit is Wdpt, q{] = f^' {Ptix) - pi) dx. 

In each region there is a transmission operator whose mission is to maximize Wi{pi,qi), 
the regional benefit. Suppose France (#1) is a net exporter {q\ < 0) while Italy (#2) is a 
net importer (q 2 > 0). Since trade between the two regions is performed at the same 
price p and q\ + q 2 = in the absence of transmission losses, the social objective is to 
maximize Wi{p,-q) + W 2 {p,q) - f^ P 2 {x)dx- f® P\{x)dx. The efficient cross-border trade 
q* solves P 2 {q) = P\{-q) which defines p* as illustrated on Figure 25.4. This outcome can 
be easily obtained by opening national spot markets to firms of both countries; this way 
arbitrage opportunities will guarantee price convergence in France and Italy toward p* . 
The welfare so obtained is a maximum W* also called first-best. 




Figure 25.4: Electricity Trade 



The transmission line bridging the two countries has a thermal limit T that should 
not be exceeded without risking a complete failure; it therefore determines a maximum 
transport capacity. If the market equilibrium level of trade q* is larger than this limit, 
the transmission system operators must step in to resolve the imbalance by forcing a 
maximum trade of T units. Yet they still wish to maximize welfare i.e., they look for a 



rationing that reduces minimally welfare (from the first best level W*). The problem is 
that no single price can generate the desired reduction q* - T in demand for transmission. 
Indeed, if there are exactly T units scheduled for transmission then the willingness to 
import in Italy is p* 2 = P 2 {T) which is greater than the willingness to export in france, 
p* = Pi(-T) as can be seen on Figure 25.4. As a result, there are still profitable trading 
opportunities and traders will ask for more transmission service. 

Applying a rationing scheme like "first-in-first-served" is not a good idea because it 
is not necessarily those who value most the transmission service that will come first to 
reserve it. Neither is it efficient to curtail all trades proportionally by a factor 1 - T/q* 
in order to force the entire demand q* down to the capacity T. It would be more efficient 
to curtail entirely trades with low WTP and allow trades with high WTP. How then do 
we modify the market rules to implement the efficient rationing? Two methods can be 
used. The first one is nodal pricing whereby different prices are quoted at different 
locations. This scheme is used with great success in Scandinavia, a market formed by 
Denmark, Finland, Norway and Sweden. A company called NordPool is granted by the 
respective governments the exclusive right to buy and sell electricity over the borders of 
the participating countries. 

Applying this idea to our example amounts to set the initial prices in the French 
and Italian markets at p° and p 2 , the respective autarkic levels. Then the transmission 
monopoly starts quoting a larger price in France (standing as a buyer of electricity) and 
a lower one in Italy (standing as a seller); he will continue until the point where the 
transmission link is fully used but not congested. The price difference between the two 
regions p\-p\ expresses the opportunity cost of using the link, hence electricity is priced 
at its "true" marginal cost in each region. At the equilibrium, the transporter buys T 
units at price pi in France and sells them at price p 2 in Italy thereby making a windfall 
profit. 

The second method amounts to tax trades that cross the border. Ideally the trans- 
mission operators compute the efficient Pigouvian tax t* - p* - p 2 in order that arbitrage 
makes the full price equal on both sides of the Alps. In both cases, the transmission 
monopoly obtains a rent that has to be redistributed to market players or invested to 
improve transmission capacity and reliability. 

Regarding fairness and price discrimination, the model's result tells us that con- 
sumers of energy located far away from production centers (or import harbors) ought 
to pay more for it since they generate a higher cost of effectively providing the service 
(transportation cost). In this sense, cities subsidy the countryside whenever a uniform 
statewide consumer price is imposed by the regulator. 



25.4 Road Congestion 

25.4.1 Origin 

Stated in few words, road congestion occurs in the urban areas of all countries because 
geographical space is limited while urban population keeps growing. What appeared half 
a century ago in the largest cities of the most advanced economies has spread over the 
world and is starting to be thoroughly studied now that government have come to realize 
the magnitude of the associated costs (cf Textbook on Transportation Economics). 

The root of the problem is the organization of production in modern economies which 
requires not only task specialization but also geographical specialization. We thus ob- 
serve the creation of a central business district (CBD) in each city, generally at the center 
or at least within a pre-existing urban area; this centric localization means that extend- 
ing the CBD is costly because transaction cost to displace neighbors are significant. For 
that reason, the CBD tends to grow vertically with the appearance of skyscrapers. On 
the other hand, residential areas in cities can grow easily (or cheaply) by extending to- 
wards the outskirts. 17 ® 

Now, the growth of population generates a growth in commuting while economic 
growth enables more people to afford a car and use it for commuting (instead of us- 
ing public transportation). These two effects compound to create an important growth 
of traffic on the arteries linking residential suburbs to the CBD. In most cases, urban 
planning has failed to anticipate this evolution and as a result, road capacity is system- 
atically inferior to the peak demand when commuters drive to and from the CBD. The 
resulting congestion is recurrent and is refereed to as "bottleneck". Other forms of (non 
recurring) congestion includes accidents, special events or works. 

Although the issue of road congestion now ranks high on the agenda of public de- 
cision makers, data has only recently begun to be collected in order to enable well in- 
formed choices. For the US, Schrank and Lomax (2005) show that mobility problems 
in large urban areas have increased at a relatively constant rate during the last two 
decades. Congestion affects more cities (not only the largest ones), more roads in each 
city, the time span where congestion can appear is greater (not limited anymore to morn- 
ing and evening rush hours) and the duration of congestion events is greater. This can 
be checked in Table 25.4 where data for the 85 largest US urban areas are gathered. 18 ® 
Travels, both private and public, have increased at a faster rate than population growth, 
thereby generating more congestion in terms of extension, intensity and duration which 
altogether contribute to a staggering increase of the total cost; the latter which only 
sums fuel and time wasted thereby ignoring all negative externalities already amounts 
to one percent of the income of the involved population. 



Year 


1982 


2003 


Yearly A 


Annual bn of vehicle-miles (roads) 


387 


781 


3.4% 


Annual bn of person-miles (public transport) 


23 


43 


3.0% 


Proportion of roads affected 


35% 


65% 


3.0% 


Congested traffic during peak time 


33% 


67% 


3.4% 


Daily hours of congestion 


4.5 


7 


2.1% 


Delay in hours/year/peak traveler 


16 


47 


5.3% 


Average delay over free-flow 


12% 


37% 


5.5% 


Reliability delay over free-flow 


60% 


100% 


2.5% 


Cost in 2003 bn$ 


12 


63 


8.2% 


Cost in proportion of income 


.4% 


1% 


4.5% 



Table 25.4: Road congestion in the US 



25.4.2 Solutions 



Since the major cause of road congestion is the inadequacy of supply capacity to meet 
demand, aka bottlenecks, 19 ® the first and obvious answer is to extend transportation 
systems, both privates ones (road) and public ones (subway, train). Yet the pace of new 
constructions is systematically inferior to demand growth because of transaction costs 
for expropriation and the price of land nearby the CBD (cf §16.1.2). This applies for 
roads as well as public transportation systems. 

A symmetric solution to the aforementioned, advocated by Vickrey (1963), is to cur- 
tail demand using either prices or quantities (rationing). The difficulty with applying 
peak load pricing to road usage at the local level stems from two factors. Firstly, most 
urban ways are free, their cost being paid by taxes; citizens thus feel the differential 
pricing scheme as a tax in disguise. Secondly, the benefits are hard to pinpoint since 
they consist of an average reduction of lost time; indeed, not all the variations in traffic 
are eliminated, only smoothed. Finally, this efficiency enhancing scheme is believed to 
be un-equitable (socially regressive) because the poorest users are frequently those with 
a high cost of arbitrage; this means that their pattern of behavior is not affected by the 
scheme, they only end up paying for what was previously free. 

For the case of road traffic, some cities like Paris have adopted a deliberate policy 
of rationing the supply below the level afforded by the physical capacity. Car use is 
made costlier and more time consuming, by limiting parking space, reducing the number 
of streets available for transit 20 ® and generally speaking increasing car related taxes. 
Such a strategy has mostly cost because traffic worsens and few benefits in terms of 
better functioning of alternative transport modes (bus, subway, bicycle) as shown by 
Prud'homme et al. (2006) in their study of the Paris strategy. 21 ® 

The alternative strategy based on prices is to charge for access to the conflicting zones 



during peak hours; this is one occurrence of peak load pricing adopted by many cities 
starting with Singapore in 1975. In the London case, TfL (2006) reports an 18% traf- 
fic reduction in the charging zone leading to a 30% reduction of the congestion delay; 
at the same time, bus use increased as well as quality of service. Lastly, some posi- 
tive externalities were notices in the form of lesser road traffic accidents and 12% less 
emissions of pollutants. Yet TfL (2008) reports that the substitution of car lanes into 
walking or biking lanes and the important works on the streets have severely curtailed 
the road capacity for cars so that congestion has returned to the 2002 level. Computa- 
tions found with the introduction of congestion charging, an elasticity of -.45. Yet, when 
the 60% raise took place a few years later, elasticity dropped to -.2. While the initial 
scheme seemed to deter some drivers from accessing central london by car, those who 
paid seemed ready to endure any increase to continue to enjoy the privilege. The report 
states that "the extension of the charging zone to the west has shown none or little im- 
provement in all the dimensions previously commented indicating that a simple and not 
overtly expensive price instrument cannot solve for all the traffic problems of a city". 

Lastly, we may look at the cost of implementing the congestion charging system which 
uses an extensive network of cameras. Prud'homme and Bocarejo (2005) find it to be 
quite large and argue that the net benefit to society is not necessarily positive but will 
surely become so once the aforementioned cost is reduced by learning and experience. 

25.4.3 Expansion 

Road Pricing 

Hau (1992) recants the origin of the debate on road pricing. Pigou (1920) observes that 
the open-access regime of many natural resources (land, sea) or infrastructures (road) 
leads to an inefficient over-exploitation. He uses the example of two roads, a fast but 
narrow one (A) and a wide but slow one (B). As traffic grows on A, travel time increases 
up the point where the advantage over route B vanishes. A toll equal to marginal cost 
minus average cost of a trip would bring traffic on route A to the efficient level i.e., the 
negative traffic externality would be internalized. Knight (1924) agrees with the exis- 
tence of an externality but sees public intervention warranted only for public ownership. 
If road property rights are well delineated and competitive pressure is present, a private 
owner derives a Ricardian rent and faces the same incentives as the social planner when 
seeking to maximize it; he thus sets the same toll. 22@ 

Mohring and Harwitz (1962) study the management of a congestion prone road as 
follows: the perceived unit cost of a trip is C{q,K) = v x r(qlK) where v is time value, q 
total output (e.g., flow on road), K capacity and t the convex cost of delay or congestion. 



The capacity cost (including building AND maintaining) 23 @ is ®(X) (e.g., rK). Demand 
arises from the elastic WTP of potential users P{q) and equilibrium settles when P{q) = 
C{q,K) + p where p is service price. Welfare is W{q,K) = U{q) - qC{q,K) -®(K) where U = 
Jq P{x) dx is the gross surplus from road use. Its maximization involves a congestion term 
since the output FOC is P = C + qC q o p = qC q i.e., a Pigouvian tax is required to edge the 
congestion term. Its effect is to depress demand down to the point where WTP is equal 
to social cost. This is symmetrical to the monopoly increasing its price to edge out the 
profit loss due to competitive demand (to be improved, ref needed). Toll revenue is then 
R = qp = q 2 Cq. The FOC for efficient capacity is O m = -qC^ => K® m = -qKCK = R because C 
is a function of the ratio -| (ie.., satisfies qC q = -KCk)- In the end, the ratio of revenue to 
cost of the asset is ^ = k, the elasticity of capital cost O. Thus, toll revenue exactly covers 
cost if and only if the road construction and maintenance technology displays constant 
returns to scale. 

A monopolist owner (or manager) would maximize profit n = pq-<${K) which amounts 
here to replace WTP P by the smaller marginal revenue P + qP' in the quantity FOC 
and maintain the same capacity FOC. Thus a larger price would be asked in order to 
sell a higher quality service at a premium. Capacity in turn would likely be smaller. 
As noted in footnote 26.3 above, if there exists an alternative road at fixed toll (zero if 
open access) and an unlimited demand for usage, then the monopolist is faced with an 
infinitely elastic demand and forced to behave as a social planner. Yet, such an extreme 
assumption is unlikely to hold. 

It is been shown empirically for highways, the existence of mild scale economies (e.g., 
doubling lanes), meaning that k is close but lesser than unity. Hence, first-best efficient 
pricing which is a positive toll to limit congestion would still run a deficit. As with 
most utilities, a second best pricing is required to allow the asset to stand on its own 
bottom. 24 ® The conclusion is reversed in urban areas because the cost of land tends to 
increase exponentially (it is a very scarce resource) and interchanges (bridges,...) are 
more numerous. Hence first-best pricing would generate a surplus. 

Paradoxes 

Downs (1962) formalizes Pigou (1920) and Knight (1924)'s verbal argument to produce 
well known capacity expansion paradoxes bearing their names. 

City residents commute by car to the CBD using a network consisting of two roads, 
one narrow and fast (A), the alternative being wide and slow (B). The demand for com- 
muting service is inelastically set to unity. Travel time on route i = A, B depends on the 
share of total traffic dt going through this route. We assume that the wide road is never 
congested so that tsidB) = t. Free flow travel time on road A is ^(d^) = f < t meanwhile the 



density remains below capacity i.e., d.A ^ k. For higher densitities, travel time increases 
linearly with TaWa) = t + ad^k. The left panel of Figure 25.5 uses d = d.A = 1 - dg on 
the horizontal axis to display f^ and fg simultaneously. We consider increasing capacity 
levels ki,k 2 ,k3 to illustrate the effects of increasing the capacity of the congestion-prone 
road. 

The equilibrium distribution of travelers is characterized by the no-arbitrage condi- 
tion: tA = ts = i and is shown with a circle on the Figure. It is readily seen that an 
increase in capacity from k\ to k 2 , increases equilibrium density on road A from d\ to 
d 2 without changing the common travel time t = t on both routes. The latent demand 
for the better road instantaneously fills any capacity melioration. If the capacity is in- 
creased enough to fc 3 then nobody uses the slower road anymore (d = 1) and travel time 
is reduced. 




Figure 25.5: Road Paradoxes 



The paradoxical result that a capacity increase might be completely ineffective is a 
consequence of the private behavior of commuters who ignore the cost they impose to 
others when switching to the fast road. The comparative statics between k\ and k 2 tells 
us the following: 

A route can be summarized by two characteristics, capacity (ampleness) and quality 
(speed or travel time). When the alternative to a narrow and fast route is a wide and 
slow one, it is advisable to improve quality before capacity i.e., make the slow route 
faster rather than widening the narrow one. 

The efficient use of the road network calls for minimizing total travel time T = dtA + 
(1 - d)tB or average travel time. 25@ In the open access equilibrium where both routes are 
used, T = t, which graphically is the area of the rectangle delimited by the axes and t. 
To internalize the negative traffic externality, one should first limit transit on the fast 
road A to its capacity (dA = k) so that a share k of demand enjoys a shorter travel time. 
Graphically, the savings are the striped area on the left panel of Figure 25.5. Obviously, 



additional capacity increases this potential time saving. Implementation of this scheme 
can bring more or less savings. If traffic lights at the entrance are used, potential users 
will be served on a first come basis which does imperfectly reflect their WTP for the fast 
road. It is thus preferable to charge for road access, the solution adopted for modern 
time-saving highways in many cities, at a level that will on average equate usage to 
capacity. If the metering technology is not too costly, it is even better to auction permits 
and have permit holders install a wireless transmitter to access the road faster. 

Mohring (1972) makes the previous paradox even more striking by considering mass 
transit as the alternative to the congestion prone route. Since the train or bus operator 
waits for a train to be full to let it circulate (in order to maximize revenue at given cost), 
an increase in demand is met by increasing the frequency of trains or buses. There is 
thus a positive externality in consumption since the waiting time of users will be reduced. 
If we let the maximum travel time on route B be t (minimum train frequency), then the 
general formula is ts - i- /3dB which is displayed on the right panel of Figure 25.5 (it is 
increasing because the horizontal axis shows d - 1 - rfg). 

The equilibrium distribution between the two transportation modes continues to be 
driven by the no-arbitrage condition. When road capacity is increased from k\ to Ic2, 
car travel time drops, thus some people arbitrate and leave mass transit so that train 
frequency worsens and even more people switch to use their car. In equilibrium of this 
adjustment process, road patronage increases, as before and, but travel time t also in- 
creases from t\ to tz. 

The paradoxes arise because there exist alternatives to the congestion prone service; 
the distribution of users between the available options then obey to no-arbitrage condi- 
tions which can trigger perverse effects. 

25.4.4 Cost Benefit Analysis f 

Data 

Prud'homme and Sun (2000) assess the cost of congestion on the most used road in 
France, the Paris freeway ring (boulevard peripherique) using detailed data from 1996. 
The bottom of Figure 25.6, with inverted axis, displays the frequency distribution of 
speed across the day. We observe that free flowing traffic is the most frequent situation 
(55% of the minutes). At that maximum speed, slightly above the 80km/h legal limit, 
there is one car every 100m on each lane or one car passing trhough every 4 seconds. 
Another quarter of the day is congestion free although travel speeds are reduced, finally 
one fifth of the time is problematic (speed lesser then 50km/h). 

The output of the transit system is measured by the flow (right axis, heavy line), the 
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Figure 25.6: Distribution of speed on the Paris freeway ring 



number of vehicles able to pass between two exits every hour; it is, at every moment, the 
product of speed (km/h) by density (vehicles/km) respectively displayed on the horizontal 
and left axis. There is congestion when the flow is forced below its maximum because 
the capacity of the system has been reached. Let us explain how this happens. 

When more vehicles enter the freeway, starting around 6h in the morning, density in- 
creases in the sense that every vehicle comes closer to its predecessor; average speed thus 
starts to decrease because drivers adopt a cautious attitude. The relationship between 
the two magnitudes is displayed as the light curve and is almost linear i.e., putting 1 
more car per lane on a one kilometer section reduces speed by roughly lkm/h. This 
phenomenon is probably best explained by the desire of every driver to maintain approx- 
imatively a 20m distance with the preceding vehicle. Yet the influx of new cars on the 
ring increases the flow steadily because traffic is nevertheless quite fluid. The maximum 
flow is reached for an approximate speed of 50km/h when the time interval between two 
cars falls down to 1.3 second which is the minimum acceptable by an average driver for 
his/her security 26 ® From that moment on, a 1% increase of density due to the entry of 
new cars, imposes a strong negative speed externality on all current users i.e., a speed 
reduction greater than 1%. Since flow is the product of speed by density, it must decrease 
which means that congestion is now present on the system. 



Theory 

The relation between speed s and density q is estimated by the linear regression s = 
s- aq where s is the maximum speed on the road (it ought to be the legal limit) and 
a is the rate of speed decrease with density increase. Since the flow is / = s x q, its 

- -2 

maximum is reached for q = ^-, leading to a maximum flow of / = £j per lane. For the 
Paris experiment, s = 94km/h and a = 0.84 leading to / = 2650 vehicles per hour per lane 
(the figure on the graph is trice because the Paris ring has tree lanes). 

For a given hour of the day the average user has a demand for the transit service 
increasing with speed or decreasing with density (since the two variables are inversely 
related); in other words his WTP 27@ for using the road at that moment is a decreasing 
function P of density q. The per km cost of using the road is the capital cost of his car 
r (including wear and tear) plus the fuel and time cost which are inversely proportional 
to speed, hence the (individual) per km cost is c/(g) = r + j = r + jS^- The social cost per 
km sums the cost of all users present at the moment, it is thus C s {q) = qctiq). The social 
value of using one km of road sums the WTP of all the current users, i.e., V{q) = tf P{x) dx. 
Welfare being W{q) = V{q)-C s {q), it is maximum at q* when V = C s o P{q) = ct(q)+qc i {q) = 
c s {q) which is the social cost per km of adding one more user. Since the individual cost 
Ci is increasing with density, the social unit cost is greater than the individual one i.e., 
c s > c t . 

It is now clear that the road is subject to the tragedy of the commons; in a free ac- 
cess equilibrium, each potential user compares his individual cost at the expected level 
of traffic to his own WTP ignoring that his decision to join will worsen traffic conditions 
for all. A commuter uses his car if P > cf, such a behavior leads to an open access equi- 
librium q° characterized by the equality P{.) = c ( (.); we thus observe on Figure 25.7 an 
excessive density. The cost of congestion is then W{q*) - W{q°) and since welfare is the 
area between the demand curve and the social cost curve, the loss due to the change from 
q* to q° is the gray area on Figure 25.7. 

Notice that the social cost computed at the inefficient density p = c s {q°) exaggerates 
the efficient price p* which is used to determine the efficient density. This observation 
can be refined as follows. By writing W{q) = V{q) - C s {q) = f^ P{x) dx - qct(q), we see that 
welfare is the area between the demand curve and the price, hence W{q*) = V{q*) - q* p 
and W{q°) = V{q°) - q°p°. The welfare loss in the open access equilibrium has thus a 
second geometrical representation; it is the striped rectangle minus the striped triangle. 
An upper bound is then q* {p° - p) which is considerably smaller than the very simplistic 
estimate q°{p-p°) computed using the observed data; the latter would be correct only 
if the demand for road use was extremely inelastic (the demand curve becomes vertical) 
which is not the case since empirical studies conclude to a value between -1 and -0.6. 



Lastly, Figure 25.7 displays as well as an off-peak demand D/ to recall us that the issue 
of congestion cost makes sense at peak time only. 




Figure 25.7: Congestion Cost Estimation 



Using the above experimental data, the efficient density and flow can be computed 
for all classes of demand. The resulting curves are shown in dashed on Figure 25.6. In 
the open access equilibrium, the cost of congestion (welfare loss) represents 9.2% of the 
total cost of using the freeway but 17.2% of the current welfare. The aforementioned 
estimation error would lead to a figure three times larger than the true congestion cost. 
The efficient situation is simply characterized by having around 20% less cars using the 
freeway as soon as speed is less than 60km/h (on average density is reduced by 14%). This 
would guarantee that speed never falls below 14 km/h and one would achieve a 3% traffic 
increase, a 6% speed increase, a 16% time saving and a 9% cost reduction. If we now 
add the external cost of pollution at 36€ per thousand vehicle km, then efficient levels 
of road use are reduced by a much larger 57% with a heavy contribution (i.e., reduction) 
at high speeds (and low demand). This might seem incoherent but one must remember 
that society values very much using the ring at moments of peak demand, thus density 
is optimally reduced by a factor one third only. On average, accounting of pollution leads 
to reduce traffic by 35% together with a 48% reduction of the time spend on the system 
and a 44%reduction of expenditure. 
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Chapter 26 
Appendix 



26.1 Miscellanies 



26.1.1 Returns to Scale 

Turgot (1844) explains how labour L (variable input) used to till a plot of land (fixed 
input) would initially be more and more productive; graphically, the slope of O which is 
labour productivity ®l increases. This process does not continue ad-infinitum because 
the land becomes exhausted; labour productivity then starts to decrease which consti- 
tutes the "law of decreasing productivity". Notice the difference between this concept 
involving partial differentiation and returns to scale which involve total differentiation. 

A° _^- — A^l 

q 





In fact, we never observe the increasing productivity part of ®i because the previous 
curve is constructed for a small plot of land e.g., an are (100 square meters). The farmer 
then optimizes his total effort as L = nL* where n is the number of ares he decides to put 
for cultivation. If he owns a small number of ares n such that his total time is L > nL* 
then his apparent marginal productivity of labour will be constant up to nL* and then 
decreasing. Obviously, plots are never of homogeneous qualities and the best ones (those 
with highest q*) are used first so that visible productivity (across a region) is always 
decreasing. 



26.1.2 Constant Elasticity of Substitution 

Under CRS, if we denote x= f then Q = ®(iC,L) = LO(x, 1) so that defining f{x) = 0(x, 1), 
the production equation reads y = f(x). The marginal productivities of capital and labour 
are ® K = Lf§ K - = f and O l = / + L/'|f = / - Lf§ = f - xf. The elasticity of substitution a 
of K into L, is the elasticity of \ with respect to the MRSk/l - |r along an isoquant i.e., 

1 dMRS KlL KIL xf d (f-xf 



a dK/L MRSk/l f-xfdx\ f 

xf -xf'f"-f"(f-xf) -xff" 



f-xf f 2 f(f-xf) 

Since in equilibrium the relative wage w equates the labour productivity we have 
w = ®l = f- xf, thus ^j = -xf" so that using ^ = /' we discover that 

dy w dy dx w fif-xf) 

b= = = = a 

dw y dxdwy -xff" 

Let us now characterise the production functions satisfying exactly (2.10). Using 
w = f- xf, we can write 



lny = lna + b\n[y - xy') o y llh = a llb {y - xy') 

a—y 

xla 

Now, using the fact that y' - -^, we have 



, a 1,b y-y 1/b yla-y 1,b y(l-ayP) .,, _i/f, .., n 

° /= Jibi = xia = x with a = a Ub ,p = l/b-l 



dx dy dy ay p l dy , , 1 , , n . 1 , n 

— = — — = -?- + — o In x = In y - - In [ 1 - ay p + - In B 

x y{l-ayP) y 1-ayP P P 

for some integration constant /3. Taking exponential, we obtain 

-i v p x p 



x = ppy{l-ay p ) p o x p = fi— o/ = 



1-ayP p + ax p 

y = x{/3 + ax p ) ~p = {/3x~ p + a)~p 
Q = L {/3K- p L p + a) f = [pK~ p + aL~ p ) f = y {6K~ P + (1 - 8)L~ P ) 



with y = {a + p)-p o a + (5 = Y p and 8 = /3y p . Since the elasticity of substitution is b = j±-, p 
is called the substitution parameter, y is the efficiency parameter (that can be normalized 
to unity for a single firm) and 8 is the distribution parameter. 



26.1.3 Rubinstein's bargaining model 

There are two extreme asymmetric Nash equilibria whereby one player, either S or N 
stubbornly asks the whole prize and refuses anything different ad infinitum; in equilib- 
rium, he obtains his demand. This is an unrealistic outcome because part of the prize is 
lost once the first offer has been refused. We thus look for a subgame perfect equilibrium 
(SPE). 

Consider a SPE where players keep repeating the same offers (stationarity) and ac- 
cept the equilibrium offer immediately (no delay). We do not know yet if it exists but we 
shall pinpoint it. For i = S,N, let St and q\ be the discount factor and equilibrium offer of 
player i. The present value for player i of rejecting qj is Siqt since he will be able to make 
his equilibrium offer at the next round and it will be accepted. As we are assuming that 
rejection is dominated, it must be the case that 1 - qj > 8iqi. However, if the inequality 
was strict, then player j could ask more for himself and still see his offer accepted. From 
the system of equation so obtained we deduce q* = x _ s l . 

1 J 

We now exhibit a strategy profile that form a no delay stationary SPE: player #i offers 
systematically q* and accept anything lesser than 1 -Stq*. When player j follows this 
strategy, player i gains q* by offering q* (which is accepted) whereas he earns less if he 
deviates. If player i is at the point of receiving the offer q* it is trivial to see that he 
better accepts it. 

Lastly, it remains to be shown that there are no other SPEs. To avoid clutter we 
consider a common discount factor. Let p be the greatest payoff the offerer, say S, gets 
in any SPE starting at t = 2. Then at t = I, in any SPE, S must accept the offer 8p 
by N. Hence, at t = 1, un ^ 1 - 8p in any SPE. But then at t = 0, un > 5(1 - 8p) so that 
u$< l-8{l-8p). Since the subgames at f = and f = 2 are the same, p = 8{l-8p) => p< ^§- 
The same reasoning with the lowest payoff q yields q > ^. Since q < p by construction, 
there is a unique SPE payoff q = p = ^o. 

26.1.4 Coase Conjecture 

Result 1 When limited by a capacity constraint, an inter-temporal price discriminating 
monopolist cannot earn less than 30% of the rental monopoly profit. 

A durable-good monopolist sells to a continuum of consumers, each demanding a sin- 
gle unit. The WTP of individual with index q e [0,1] is v{q) e [v;v] with v > and v' < 0. 1@ 
The monopolist's revenue function, R{q) = qv{q) is assumed to reach its maximum R* at 
some q* > 0. Both consumers and the monopolist live forever and discount the future at 
the rate r. Sales can occur every t minutes. The discount rate per period is thus 8 = e~ rT . 

Let T be the last period of sales. Observe first that the price listing [p t )t<T must be 



decreasing because no one would wait to buy dearer. Next, it must be the case that pr = v 
i.e., the last consumer buys the good. Indeed, if we had pr > v, then some people would 
be left unserved which is irrational for the firm as she can extend sales for one additional 
period and make an additional profit. That pj<v_ cannot be true is also obvious because 
it would amount to make undue rebates to the last buyers. Our third observation is 
that the marginal buyer at time t must be indifferent between buying now and later if 
he derives the same net present utility. Denoting v t the WTP of such a person, it must 
satisfy v t - pt = 8{v t -pt+\) => Pt = (1-5) v t + 8p t +\, hence 

p t = 6 T - t v + (l-6) £ 8h t+j (26.1) 

7=0 

From our three observations, we deduce that the only strategic choice for the monop- 
olist is to choose the duration of sales T. A crucial feature of the Coase conjecture is 
the ability to sell any amount at any time. Such an untenable assumption is debunked 
by assuming that the firm must choose (at no cost) an instantaneous service capacity K 
before the start of sales; it enables to sell up to Kt units in every period. Since sales take 
place during T periods and the entire market of unit size is served, we have TKt = 1 so 
that there is only one variable to choose, either T or K. 

The monopolist equilibrium behavior (capacity choice and price listing) is hard to 
characterize, thus we study her profit along the sub-optimal behavior consisting in sell- 
ing exactly at capacity in each period i.e., q t = Kt = ^. From this simple accounting 
statement, we deduce the WTP of the marginal buyer in each period: v t = v{tlT). The 
profit along the sup-optimal path (security option) is 

T i _X T T-l-t -, _* r-i 

f=l 1 t=l j=0 1 f=l 

as the double summation simplifies. Let s = q*Tbe the time necessary to sell the mono- 
poly quantity at which point v s =v{q*). Since T > s and v t is decreasing, we can limit the 
summation index to s instead of T and substitute v s for v t . We thus obtain 

% > y to — R = ft 

T £1 s{l-5) z 

using R* = q* v s = ^ and introducing z = s[l - 5). By l'Hospitals rule, p=j^ = 1 _ r e T ,. T — ► 1, 

hence 8 s = e~ rTS = e~^s - e~ z P — ► e~ z . Thus, at the limit where the time interval r vanishes 

(and 8 -^ 1), 

fc l-e~ z {l + z) 

— > y = 

R* ,z z 



whose maximum is 7* = 0.3 for z* = 1.8. 2@ Lastly, we can compare the capacity corre 

z* 

a-8) 



sponding to this minimum profit with the monopoly sales. 3@ As q* T* = s* = ttzx), we have 



K* _ 1 _ 1-8 _ r r 



.,.; - - ,,.. - .56r. Given that the subjective interest rate r is typically 

L4 (lit L/C L) Aj -, f\ Aj 

less than 10%, the service capacity is less than 5% of the rental monopoly sales. 

Continuous time One can check that up to 1-5 which vanishes to zero with t, n = 
T,J=i[vt+i^r- Vt-f)?* 1 i- e -> the discounted value of marginal revenue. This formula ex- 
emplifies the sources of profit loss for the monopolist. Firstly, because she can't commit 
to stop selling at any point in time, she ends up selling too much i.e., beyond s to peo- 
ple with R m < 0. Secondly, profitable clients (those with WTP greater than v s ) only buy 
gradually since K* , the per period sales, is a tiny fraction of the straight monopoly sales 
q*. 

In equilibrium, buyers are ordered thus buyer Kt buys at time t paying price p t . 
Since this is an optimal behavior, his utility u{Kt) = e~ rt {p t - v{Ki)) maximizes the utility 
from buying at an alternative date s which is e~ rs {p t - v[Ks)), hence, taking derivative, 
u'(Kt) = e~ rt v'(Kt) must hold. Then using p T = ^ => u{KT) = 0, we integrate to get u{Kt) = 
-Jj e~ rs v\Ks)Kds = f^ t e~ rx,K v'(x)dx = e~ rt p t - f^ t je' rx,K v(x)dx (integrating by parts). 
Given the definition of utility, we derive p t = e rt f^ae ~ ax v{x)dx with a= £. 

Now, firm profit is 

pT r-q r-q r-q 

7i= j e rt p t Kdt= j axe ax v{x)dx= j R m {x)e ax dx= j aR{x)e ax dx 
Jo Jo Jo Jo 

using integration by parts in the second and last equalities (with R{q) = R(0) = 0). Now, n > 
ff fR{q*)e~ ax dx since v{x) = R{x)lx is decreasing. Thus ^ > ff fe~ ax dx = 1 ~ e "^! +fl ' 7 * 
7 which can be maximized in z = aq* at z - 1.79 yielding n > 0.3R{q*). 

26.1.5 Oligopoly 

Innovation in the Cournot Oligopoly 

Extending (12.5) to oligopoly using the Cournot oligopoly equilibrium quantity (5.14), we 

obtain 

a-b{nci -(/i- \)c-i) a-bc + b{nxt -{n- 1)jc_j) 

qt = 

n+l n+l 

where c_ ; (resp. X-i) is the average marginal cost (resp. R&D level) of firm i's opponents. 

The FOC of optimal R&D is thus -^qt = j£ = |f- = 2Ax ; -. In a symmetric equilibrium, 

xi = x-i = x, so that the FOC becomes n{a -bc + bx) = [n + I) 2 Ax leading to the symmetric 

equilibrium level x = J^+\?--bn ' ^ e ensuin g equilibrium quantity is then q = j5J£$£^ 



with profit ft = \q 2 - Xx 2 = AjK 2 ( A(n+ 2 1 ^ 2fcw2 ). We obviously need to assume A > j^ to 
avoid the trivial zero R&D corner equilibrium. 

Regarding comparative statics, we note that investment, quantity and profit decrease 
with the quality of R&D (A) and the number of firms (n). Indeed, we obviously have ff < 
while algebra reveals that ||oc X(l-n 2 ) < 0, ^ oc b- A(n + 1) 2 < and g| oc -nb{n + l) <0 
(profit derivatives follow likewise). The effect of changes in a and b are as in the duopoly 
case. 

Innovation in a Price Oligopoly 

The original duopoly of Hotelling (1929) is extended by Raith (2003) to oligopoly using 
the circular city model seen in §11.1.3. His conclusions are achieved using management 
incentives of the form given in §20.2 and more specifically equation (20.5) (his equation 
(A3)). Our identification of R&D investment with many types of innovative activities 
thus readily generalize his findings. 

When the n active equidistant firms have potentially different cost q, equation (11.10) 
characterizing the best reply of firm i is \ {-pt-\ + 4p t - pt+i) = zi where z t = ~ + c ; -. In 
matrix form, it reads Ax p = z where the entries of line i of A are an = 2, a^-i = a^+i = -\ 
and aij = for j ' ^ i - 1, i - i + 1. Matrix A is circulant i.e., each row takes the previous one 
and shifts it one place to the right. The inverse exists and is also circulant; its first row 
has coefficients -^ Ig±vg!+|±yg^ for A: = 1 to n. 

Since A is invertible, the price equilibrium is p = A~ l z and we can write 

Pi = -+yci + 6c-i (26.2) 

n 

where c-i is a weighted average of the cost of i's competitors. For instance, if n = 3, 
y=| and 6 = \ while for n > 5, y — -^ = 0.58 and 6 - 0.31. 4@ What is fundamental in 
this equation is the presence of 6 showing that the equilibrium price is sensitive to the 
industry cost structure, not just the own technology. 

Let us plug this result into profit to investigate the optimal cost reduction. Profit is 
m = ipt - a)Di, thus §| = [y-l)Di + { Pi - a) 9 -^- by (26.2). To compute g, observe that 

m (2t \ m [It „ ^ 

A- = - - + Pf+i + Vi-\ -2pi\ = -\ — + (d- 2y)a + ... (26.3) 

2t V n j 2t \n ) 

as the coefficient of c ; - in either p i+ i or p^\ is 812 (cf. footnote 26.3). We thus get -J^- = 
(5/2 - y) y and -^- - (7- l)Dj + [5l2-y) I j[pi - a). In a symmetric investment equilibrium, 
firms have identical costs thus the price equilibrium is also symmetrical so that D t = ^ 



and pi = | + c - x, hence 

— L = {y - i) _ + (5/2 -7) = (5/2 - 1) — a -0.85— (26.4) 

i.e., the strategic effect of R&D investment over price shrinks the direct effect ^ by a 
minimum of 15% thus proving the claim made in the text. Notice though that Raith 
(2003), in his equation #2 uses the FOC instead of the equilibrium formula (26.2) i.e., 
sets 7 =\ and 6 = and thus fails to identify the strategic effect of competition upon 
investment incentives. 

The FOC for R&D investment at a symmetric equilibrium is thus 0.85^ = Ax => x = 
0.85 j^. Profits are then it = ^[t- °- 8 2 5 ;i m -F where F is the fixed cost of operation. If the 
transportation cost falls (t\) or products become less differentiated or entry cost rise 
(F /), profit falls thus generates exit (n \) which in turn boosts the sales and innovation 
of the remaining active firms. A market size expansion (m /) boosts profits but at a 
less than proportional rate, thus entry occurs with an elasticity lesser than unity so that 
innovation, being driven by the ratio of m over n, increases (although there are new 
firms, old ones still manage to produce more which spurs innovation). 

State of Nature 

Bush and Mayer (1974) consider the original state of nature, anarchy, where private 
property does not exist and agents can invest some of their scarce resources to rob oth- 
ers. Since initial income is exogenously given, this is not a fully fledged general equilib- 
rium model. Once the Nash equilibrium is characterized and shown to involve wasteful 
offensive and defensive investment, the authors look for the conditions enabling a jump 
out of anarchy towards a peaceful and orderly society without theft. 

Each person has a differential endowment fc ; of (capital) income and consumes k t = 
ki + c\i - q~i where qi is preying upon others while q-i is the average preying from oth- 
ers. Differentiated theft ability is measured by the maximum preying resource //. Let- 
ting /; = U - qi denote leisure and wt = h + U full income, the budget constraint reads 
Wi = ki + li + q-i i.e., income is divided between production, leisure and loss from theft. 
Assuming that people only care for consumption and leisure, utility maximization leads 
to demands that are increasing in the net income w \ - q~i. From Z ; - = <p t {Wi - q~i), we de- 
duce the appropriation best reply qi = li-(p t {Wi - q-i) to the effort q-i of others. A Nash 
equilibrium in this context is called a natural equilibrium. If (p^wO < li holds true then 
the equilibrium features wasteful appropriative activities. This condition states that in a 
world of non wasteful people, one would want to spend part of his/her time to appropriate 
others' wealth. 



In the Cobb-Douglas case, u{k,l) = k a l l ~ a , thus (p[w) = {l-a)w so that the previous 
condition reduces to (1 - a)/// < alk t i.e., the marginal utility of leisure is lesser than that 
of investment (at the respective endowments). This is likely to hold true for those with 
little capital or very productive at appropriation or favoring consumer goods over leisure. 
The derivation of the equilibrium follows the Cournot method of §5.1.3. The best reply 
equation simplifies into qi = alt - (1 - a) (kt - q-i). Summing over all participants yield 
q = al-{l-a)[k- <?) => q = 1- ^-^k where ? denotes average. Plugging q- t = nq n ~^ 1 into the 
best reply equation enables to solve for individual levels. 

26.2 Risk and Uncertainty 

26.2.1 Time Discounting 

Result 2 The present value of 1€ to be paid at time t with interest rate r is e~ rt . 

Consider an asset whose value at time t is v{t). The percentage increase of value 
over a period of length r is vit+ ^ v[t] , thus the interest rate over this period is v[t+ T r J^ t] vW ■ 
Letting r tend to zero, we obtain the instantaneous interest rate at time t as r{t) = dvl v dt . 
Integrating this equation yields v[t) - v{0)e^ rWdT . If we consider now a bond paying a 
fixed amount per period of time, then r (t) is a constant r so that the return of 1€ deposited 
for a length of time t is exactly v{i) = e rt . The present value of 1€ to be paid within time 
t is thus e~ rt . 

26.2.2 Euler Equation 

Result 3 Solution ofmax x f£° v{K t ,I t )e~ rt dt under the restriction K =Kand Kt = K. 

We use Lagrange's calculus of variations to solve max x f Q f{t,x,x)dt s.t. x = x,xt = x 
where x is a function of time, x its time derivative, T < +oo the horizon and f{t,x t ,x t ) the 
differentiable objective. Assume an optimal path x* exists and consider y s.t. yo = and 
yr = so that the distorted path x* + ey is still admissible for any e. The optimality of x* 
means that J{e) =max x / r /(r,x* +ey,x* +ey)dt reaches a maximum at thus 5@ 

= /'(0) = fvyf x (t,x*,x*)dt + f^ yf x {t,x\x*)dt 
= Jo[fx-fx)y&t + f XT y T -f Xo yo 

using integration by parts. By construction of y, the last two terms are nil, thus the 
integral is nil. As this is true whatever the shape of y, the parenthesis must also be 
nil and this constitutes the Euler equation 6@ -^ - jU^)- The transversality condition 



xrfx T = evaluated at an optimal solution is necessary, hence when there is no reason 
to impose the terminal condition xj = Jc (aka free boundary), we still need fx T = 0. For 
economic applications, we have x = K, x = I and f[t,x,x) = e~ rt v{K,I), hence (18.3). 

26.2.3 Risk Aversion 

Result 4 Constant absolute risk aversion (CARA) utility function: u{m) = -e~ pm 

If p is a positive constant, the differential equation ^ = -p has solution lnu'(m) = 
a - pm where a is a constant. Next, check that u(m) = c - be~ pm for constants b > and 
c. Since the expected utility function is unique up to an affine transformation, the result 
follows. 

Result 5 Pratt (1964)'s Theorem that the following are equivalent: 

® Mister v is more risk averse than Mister u. 

® v is a concave transformation of u. 

® Mister v's ARA index is larger than Mister u's. 

® Mister v's risk premium is larger than Mister u's. 

• ®^@ By contradiction: if 0(z) = v{u ^{z)) is not everywhere concave, it is locally con- 
vex at some w. Let us choose gamble x indifferent to u i.e., E[u{w + x)] = u(w), then 
E[v{w + x)] = E[(f){u){w + x)] >(f)(E[u{w + x)]) = (p{u{w)) = v{ w), by Jensen's inequality, thereby 
contradicting ® since v ought to refuse this gamble. 

• ©^®: As 0' = p§=ry, we derive 0"oc v"u'- v'u" < 0o p v > p u . 

• (D— ®: For every fair gamble x, letting z = u{w + x), there exits a risk premium p u 
such that u[w- p u ) = E[z]. Likewise 3p v such that v{w- p v ) = E[v{w + x)] = E[v (u~ l (z))\ = 
E[(p{z))\ <(p(E[z\) by Jensen's inequality, hence w- p v < v~\(p) {E[z}) = u~ l {E[z\) = w- p u . 

• ®^®: consider the lottery z = p v + x and initial wealth wq = w - p u . We know by defini- 
tion of p u that agent u does not reject this lottery, yet E[v{wq + z)] =E[v(w + x)] = v{w-p v ) < 
v{w- p u ) = v(wo). If the lottery were slightly modified so that u rejects it, then v would 
also refuse it. 

Result 6 Proof that p' u <0o p' u < 0. 

For a fair gamble x, the risk premium p solves E[u[w + x)] = u{w-p). Differentiating 
yields E[i/(u/ + jc)] -u'{w-p) = -j^u'{w-p). Thus ^ <0 iff a consumer with utility v = u' 
accepts gamble x indifferent to u i.e., he is less risk averse which implies p u , = =^- < p u = 
z J L<>0> t£ = Pu(Pui-PJ after re-ordering. 

Result 7 Arrow's claims for sensible risk preferences. 



Axiom 1 : given a gamble y and wealth m, E[uiy+m)] > u{m) => E[uiy + m + e)\ > uim + e) 
for e > i.e., there is continued acceptance upon becoming richer. 

We must prove that this property is equivalent to DARA. Assume Al holds. Pick a 
fair gamble x, some wealth w and the associated risk premium yu. We have E[u{x+ w)] = 
uiw-ff) o E[u{y+ m)\ = u{m) for y = x + /i and m = w-jx. By Al, E[u{x+ w + e)] > u{m + e) for 
e > 0. Taking the Taylor expansion leads to E[u'(x + w)] > u'{m) i.e., Mister u' takes the 
gamble, thus is less averse than Mister u. As seen above in Result 6, this means that 
the utility function is DARA. 

Conversely, let DARA holds and consider an accepted gamble y at wealth m. Let 
x = y-E[y] and w = m + E[y], then seek the risk premium [i<E[y] such that E[u(m + y)] = 
E[uix + w)] = u{w-fX) > u{w-E[y\). Apply Result 6 to obtain E[w(y+ m + e)\ =E[u{x+ w + e)} = 
u{w + e - p) > u{w + e - jS) > u{w - E[y] + e) = u{m + e) since fi< [i. That is to say, the richer 
person keeps accepting the gamble. 

Axiom 2 : given a gamble y and wealth m, E[u(y + m)] > u(m) => E[u{Ay + Am)] > u(Am) for 
A < 1 i.e., there is continued acceptance upon scaling down gamble and wealth. 

To prove that this property is equivalent to IRRA, one uses the same differentiation 
method. 



Result 8 The objective riskiness measures are homogeneous, sub-additive, convex and 
are FOC and SOD monotonic. 

Homogeneity: obvious, for the solution of E[e~^ lz ] = 1 is z = A<p g and likewise the solution 

of E[log[l + Ag/z]] =0\sz = \(p g . 

Sub-additivity: For the de Finetti measure (p g , write r = cp g , r' = (p g , and observe the 

Since the exponential is convex, we have E 



decomposition ^- — 



r+r' 



+ 



r+r' r r+r' r 



g+g 

g r+r' 



^ 



e r 



r 

r+r' 



+ TTT7E 



e r' 



= 1, hence r + r' > R{g + g') (cf footnote 26.3). For the Bernoulli 



measure 0„, write r = <p , r' = <p s ,. The concavity of log yields 



log 1 + 



g + g 

r + r' 



r + r' 



log(l + f) 



+ 



r + r' 



logjl + ^ 



hence r + r'> R{g + g') (cf. footnote 26.3). 

Convexity: a corollary of Homogeneity and Sub-additivity. Indeed, if /(Ax) = A fix) and 
fix + y) < fix) + fiy), then /(Ax + (1 - X)y) < /(Ax) + /((l - X)y) = A fix) + (1 - A)/(y). 

FOD : We say that g > g' when H g i.) < H g >i.) with at least one strict inequality i.e., 
the former gamble puts more weight on large prizes. Result 10 proves that g > g' o 

FOD 

E[uig)] > E[uig')] for all strictly increasing u. For the de Finetti measure, we apply this 



to the decreasing u{x) = e~ ax . We obtain E[e" a ^] - 1 < E[e" a ^] - 1 for all a > 0. Since both 
are negative (cf. footnote 26.3), their zeroes are p g , > p g => q> g = l/p g < <p g , = II p g ,. For 
the Bernoulli measure, we use the increasing u{x) = log(l + xla) and proceed likewise to 
obtain cp g < <p h (cf. footnote 26.3). 

SOD : For two gambles with the same mean, we say that g > g' when E[u{g)] > E[u(g')] 
for every increasing strictly concave u. For the de Finetti measure, we use the fact that 
u{x) = e~ ax is convex to obtain the same ordering as above and thus the same conclusion. 
For the Bernoulli measure, we use the fact that u{x) = log(l + xla) is concave to obtain the 
same ordering as above and thus the same conclusion. 

26.2.4 Choice under Uncertainty 

Result 9 A less risk averse person invests more into the risky asset. 

By result 5, if is a convex function, then individual 2 with utility cp{u) is less risk averse 
than individual 1 with utility u. The expected utility for individuals 1 and 2 of investing 
a share A into the risky asset are 

UiiX) = I u{\ + r + A(r - r )) h(r) dr 

and 



U 2 m = f 0(k) (1 + r + A (r - r )) h(r) dr 



The FOC of maximization are 



and 



= [/{(A) = / ifr{r)h{r)dr where y/(r) = (r- r )u'(l + r + A(r- r )) 
= U^X) = I y{r)h(r) dr where for) = </>' (u (1 + r + A (r - r )) h(r) 



The change from H to H satisfies the MLRP because cf>' = j- is an increasing function of 
r. Furthermore, (r - r )y/(r) > is true for all r, thus y/ satisfies the SCR Hence applying 
result 12, U' 2 {X) > U[{\) which means that if the RHS is zero for some optimal X\, the LHS 
will be zero for some larger A 2 . ■ 

Result 10 Stochastic dominance 

6 > 6 o E[u{x) \d] > E[u(x) \9] for any increasing utility function u. 

FOD 

6 > 8 oE[ u{x) I d] > E[u{x) 1 6] for any increasing concave utility function u. 



FOD : If u is a simple staircase with u{.) = on ] -co;z[ and u{.) = m on [z;+oo] then 
E[u(jc)|0] = fu(x)h(x\6)dx = m{l-H(z\8)) > m(l - H(z|0)) = f u(x)h(x\6)dx = E[u(x)\d]. It 
then remains to observe that every increasing staircase function is a sum of simple stair- 
cases and finally that every increasing function is the limit of a staircase function with 
many small steps. 

Conversely consider the step function u{.) = over ] -oo,x] and u{.) = 1 over ]x;+oo] to 
get E[u(x)\8] = I- H(x\8). 

SOD : Using integration by part, E[jc 1 8] = f% xh(x \ 8)dx = bH(b \ 8) - SS H(x \ 8)dx - aH(a 1 8) - 
Si H{x 1 9)dx which tends to / +oo (l - H(x \ B))dx - /.^ H(x \ d)dx. Consider the function u(x) = 
min{x, z} and redo the previous computation to find E[u{x) \ 8] = / Z (1-H(x 1 0))dx-/^ oo H{x \ d)dx 
z-S*H(.\6). Hence, E\u{x)\0]>E[u(x)\d]=> f*H(x\8)dx< /* H{x\8)dx i.e., 8 > 8. 

Conversely, A = E [ u(Jc) 1 8] - E[u{x) \8]=S u{x)dH with H = H{x \ 8) - H{x \ 8)). Integrating 
by part, we obtain A = -/ u'{x)H(x)dx since H = at both extremes. Integrating by part 
again, we get A = - uG + / u"G where G{z) = /f^ H(x)dx < by definition of 8 > 8. Since u 
is increasing concave, A > 0. ■ 

Result 11 MLRP (j^ / ) implies first order stochastic dominance (FSD) i.e., H 2 < H\. 

If jf \ was always true then by taking derivative we would have ^ < ^. By definition of 
a distribution, H 2 (l) = Hi(l) = 1, thus 1 > ^(1) > {£(*) by MLRP. We deduce h 2 < hi. Since 
hi ^ ^2, there is a strict inequality for some values so that by integration H 2 (l) < Hid), a 
contradiction. Our initial assumption is not always satisfied, thus there is some y such 
that j^{y) >jf{y). For any z > y, if |p(z) = ^(z) then z is a local maximum of |^, thus this 
curve will not cross the -^ curve since the latter is always locally increasing. L'hospital 
rule states that lim |r(x) = jriO), thus we can conclude that y = i.e., jr < jr everywhere. 

As |p is increasing, the definition of a distribution implies that jf{l) = 1 > jf- ■ 

Result 12 If the family of distributions h{.\8) satisfies the MLRP and / satisfies SCP, 

then8>8^E[f(x)\8] >E[/(x)|0]. 

By MLRP, we see, checking separately for x\y, that U-y)^f{fy ^ ^"J^fnfr is true ! 
further this is strict for some values for otherwise h{.\8) would not be different from 
h{. 1 8). For any function g satisfying SCP, we can replace {x - y) by g{x) in the inequality 
above to obtain 

h(x\8) h(x\8) h(y\8) a 

g(x) — — <g{x) — o g{x)h{x\8) < —g(x)h(x\0) 

h(y\8) 6 h(y\8) h{y\6) 



with strict inequality for some values of x. Hence, E [g{x) \d] = implies E [g{x) \0] > 0. The 
claim is proved by applying this result to the function g equal to / minus the constant 
number E [fix) \ 0\ (if / passes SCP, g also). ■ 

Result 13 A change in distribution of the risky asset increases its demand whatever the 
risk aversion (u) and whatever the risk-free rate (ro) if and only if the change satisfies the 
MLRP. 

Consider a dominant change from Hi to H 2 . By definition of f rQ {r) defined in result 10, 
{r - r )y/ rQ {r) >0 is true, thus f Q l y ro {r) dH x {r) =0=> f x y/ ro {r) dH 2 {r) >0 by virtue of the previ- 
ous result but this means that the optimal share X 2 under H 2 is larger than the optimal 
share A\ under H\. 

Conversely, for any risk-free rate ro, we can apply the lemma below to f{r) = tf ro {r) and 
g{r) = ^(r)f ro {r) and claim that 3a ro ,Vr e [0;1], g{r) > a ro r (r). Developing iff and sim- 
plifying by u', we obtain l^(r) - a r A (r - r ) > i.e., ^ is equal to a ro at r = r and is 
locally increasing to respect the inequality. Since this is true for any r , the ratio is ev- 
erywhere increasing thus -jjf{x) is decreasing i.e., the MLRP holds. 

Lemma: {V H, JfdH = 0^ JgdH> 0} => {3a, g> af over [0;1]} 

Under the premises, the program min f gdH s.t. f fdH = and f dH = 1 must have a 

dH>0 

non negative value. Since the constraints and the objective are linear in dH, the Kuhn- 
Tucker conditions are both necessary and sufficient to characterize the optimum H*: 
Vx £ [0; 1] , g{x) - af{x) - [3 > iidH* [x) ^ where a and f> are the Lagrange multipliers (cf 
§2.2.1). Integrating and using the constraints, we obtain = / (g(x) - a fix) - [3] dH* [x) = 
f gdH* -[3. Lastly, the Kuhn-Tucker conditions imply that Vx £ [0; l],g{x)-af{x) is bounded 
below by [3 = f gdH* > 0. ■ 



26.3 Auctions and Finance 

Result 14 The optimal selling mechanism when marginal revenues are not always in- 
creasing. 

Consider first one potential buyer. Let f{q) = R m [H~\q)) o R m {v) = f{H{v)), let F{q) = 
fa f{x)dx, then let F = max{G, G < F,G convex} be the convex envelope of F. By construc- 
tion, / = F > so that R m (v) = f{H(v)) is increasing. Observe that 

A=( (p(v)(f-f)(H(v))dH(v) = [(p(v)(F-F)(H(v))\l- [ (F - F)(H(v))d<p(v) 
Jo Jo 



when integrating by parts. Since F and F agree at the extremes (by construction of F), 
the first term is zero. The objective of the seller is 

EtyRm] = [ (p(u)f(H(v))dH(v)= [ (p(v)f(H(v))dH(v) + A 
Jo Jo 

= E[(pR m ]- [ {F-F)[H[v))d(p[v) 

Jo 

Returning to the case of several potential buyers, the seller's revenue (cf. eq. 22.10), 
up to a constant, is 

Ws = E[Li>o(p i (v)RmAvO]^E[Li>o(p i {v)Rrn,dv i )]-Z^o^i (26.5) 

where A/ is the generalization of A to bidder i. Since each A, > by construction of Ft, 
the maximum of Ws is reached by maximizing the first term if at this maximum L/>o A*' 
is nil. Allocating, as in the regular case, the item to the bidder with the highest adjusted 
marginal revenue R m> i maximizes this first term but also makes the second nil. That each 
A; = is true is because at every value vu either H,-(v,0 is such that F t = F t or otherwise F t 
is locally linear, meaning that / ; is locally constant, meaning that R m j is locally constant, 
implying finally, that under the proposed allocation rule, the probability of winning the 
item is also locally constant i.e., d(p t (v) = which makes A/ nil. 

Result 15 The optimal strategies in Chatterjee and Samuelson (1982) trade game are to 

? . s+3b 7 ? 3s+b 

announce p\, = %b+ —^ and p s = |s + —^. 
The player's payoffs are 

j m?± if Ps < Pb j *-&+& if Ps < Pb 

n s = < z and %}, = < z 

[ s if p s >Pb [ if p s >Pb 

For i = b,s let piivt) denote i's betting strategy and q\ its inverse (assuming it is increas- 
ing). The seller chooses p s and there is trade if the random b is such that p s < Pbib) o 
qbiPs) ^ b; likewise the buyer chooses p b and there is trade if the random s is such that 
p s {s) < pb o s < q s {pb)> hence each profit is a function of the unknown valuation: 

n s {p s ,b) = { 2 

[ s if qb(Ps) > b 

and 

J b _E^Et if s < qApb) 

n b {pb,s) = \ ■, 

if s>q s {p b ) 



The expected payoffs are: 



(s-s)u s (p s ) = f n s (p s ,b)db = f " ' sdb + f &±E 

Js Js JqbiPs) 

b-b)u b (p b ) = J n b (p b ,s)ds = J [b-^P^jdb 



The FOCs are 



du s _ dq b Ps + Pb(qb(Ps)) dq b f S db _ ( n ) d ^b,l(^ „ f (rt O 



ap fc 



„ = ^ ^_Emf^)^.j^ n, f = {h . Pb) ^ + i {k . %{Pb)) 



Let us look for linear rules ptivt) = <Xi + f3 t Vi o qdpt) = ^-g— L for i = b,s. The FOCs now 
read 

f ^ 1 1 (Ps-oc b _\ a b + p b s 2 2 

. , 1 l(pb-a, ) <Xs + bp 2 2 

[b-p b )- = -[-^--b] o Pi = __ + -^^=- 



We are left with the system 3a 5 = a^ + /3 b s = a b + \s and 3a^ = a s + b{5 s = a s + b\ whose 

i . . . s+3b 3s+b _ 

solution is a b = -j^, cc s - —^> ■ 



Result 16 Proof that a bilateral trading mechanism cannot be at the same time efficient 
in trade, balanced in budget and guarantee the participation of both agents. 

The revenue equivalence formula (22.3) remains true i.e., for i = b,s mil) = Ui(Vs+fl (p t {x) dx. 
As t b {b) = b(p b {b) - u b (b), the overall expected payment is 

E[t b (b,s)]= lb(p b (b)- (p b (x)dx)dH b -u b (b) 

= f (p b {b) [b - lj j$r) dH b - u b (b) (integrate by parts) 
= fjz b {b,s)(b- l -^p-)dH b dH s -u b {b) (26.6) 

where z b e {0; 1} is the buyer's allocation rule in equilibrium of the auction. Likewise, 



using u s {s) = u s {s) - f*(p s {x)dx, we get 

E[t s {b,s)] = JJ z s (b,s)[s + ^]dH s dH b -u s (s) 

= JJ (l-z b (b,s))[s+^)dH s dH b -u s (s) 

JJ z b (b,s)(s+f^)dH s dH b -u s {s) (26.7) 



= s- 



since JJ (s+ gg) dH s dH b = j(s + fg) h s [s)ds = [sH s (s)]l = s. 

Summing (26.6) and (26.7), the expected total payment to the broker is 

A = s- u s (s) - u b (b) + JJ z b [b, s) [b - ^^ - s - fg) dH s dH b 

If the buyer with the lowest value b derives no extra surplus from participating then 
u b {b) = 0; likewise, if the seller with the highest value s derives no extra surplus from 
participating then u s {s) = s. The maximal value for A is thus the integral term. We now 
use the efficient rule z* h = 1 o b > s to show that A < i.e., the broker must bring funds. 
We denote bAs = min {b, s}, we can write A = f b A{b) dH b where 

** s 

= [(b-^^-s)HM]^-(b-^^-bAi)H t »Aii 

_ I - i w H ^ <0 if b< ~ s 

1 b -*- 1 T$r = R'nto-s if b ^' s 

Over [s;b], the average of A is nil because (22.6) applies telling us that s = E[R m [b)\b>s], 
Given that A is negative over [b; s] , A is negative. ■ 

Result 17 Proof that the average price mechanism maximizes the gains of trade among 
budget-balanced mechanism guaranteeing participation of both agents. 

To avoid running a deficit, the trading mechanism must use a rule z b such that A > 0. 
Let f e {s, b) = b-s-el l ~^ b \ ] + f^g and define the transfer rule z e = lf e > which maximizes 
Jf zf e unconditionally. The optimal rule z* solves 

| max z ff zf | max z JJ z (/ + A/i ) 

J s.t. /jfz/i = ' [ s.t. jOT«/i=0 



for some A > 0. Observe then that /o + A/i = (1 + X)f e for e = -^. It thus remains to exhibit 
one e such that g{e) = ff z e f\ = to have our solution. This is done by showing that g(l) > 
(obvious by construction of z\), g(0) < (this because A < for the efficient rule z Q ) and 
that g is continuous which comes from the fact that a change in e, changes the bound in 
the integral through a change in the solution of f e = which is continuous in e. 

With uniform distributions over [0; 1] , f £ is,b) = b - s - e(l - b - s) = (1 + e) ib-s)- e, thus 
z e = 1 iff 5 < b- ^ and g(e) = f\_ J^" 1 ^ (2(b- s) - 1) dsdb = ^f^s, thus e = i leading to 
b - s > \ as claimed. ■ 



Result 18 There is over-investment in the free-cash flow model. 
To prove that claim, we use the fact that R m {k*) = 1 = E[p] to write 

l = R m {k*mp] = Rm{k*)l pdH{p) + R m {k*) pdH{p) (26.8) 

Jo Jp 

The repayment to the lender being b = min{pR{k), (1 + r)k], we can write 

E[b] R(k) fP 



■ [ pdH(p) + (1 + r) (l - Hip)) (26.9) 

Jo 



k k 

If the participation constraint E[b] >kis binding, then observe from (26.9) that 



1< 



rp 
■p dHip) + (1 + r) (1 - H{p)) = Hip) + (1 + r) (l - ff(#) 
Jo 



hence r > 0. 

Next, we use the fact that ^^ > R m ik) to deduce that the first term of (26.9) is greater 
than the first term (26.8). Given that (26.8) and (26.9) are equal, it must be the case that 
seconds terms are inversely ordered i.e., 

(1 + r) (1 - Hip)) < R m ik*) / pdHip) o (1 + r)< R m ik*)E[p\p > p] (26.10) 

Jp 

The risk premium of investors being ft = J- 00 [pRik) - (1 + r)k) dHip), we have 

oc R m ik*)E{p\p > p] - (1 + r) > by (26.10) 



dfi 
dk 



k* 

meaning that the optimal investment k is greater than the efficient one k* 



Notes 



1.1 Since each agency uses its own typology, 
some figures appear widely apart when in fact 
they reflect slightly different aggregates. 

1.2 cf. annual exchange rates. 

1.3 Early versions of this work borrowed from 
the books by Gabszewicz (1994), Rasmusen 
(2006), Shy (1996) and Tirole (1988) as well as 
teaching notes posted on the internet. 

1.4 This, as far as we know, is a novelty at the 
textbook level in micro-economics. 

1.5 Taking an example from the realm of 
physics, Newton's theory of gravity can be seen 
as a model of the trajectory of the apple from 
the branch of the tree to Newton's head. It can 
be applied not only to apples but also to other 
fruits, parachutists, rockets or balls. 

1.6 This means for example that bits from 
quantum physics are irrelevant to build New- 
ton's theory of gravity. 

1.7 Formally, game theory studies competitive 
activities in which participants strife against 
each other according to a set of rules. 

1.8 For instance, behavioral economics, and 
in particular prospect theory, has shown the 
limit to the expected utility theory (cf. §19.1.3); 
its finding regarding non fully rational behav- 
ior are being integrated into the mainstream 
framework. 

1.9 Introductory online courses are available at 
economicsnetwork; for more a more advanced 
level check with introecon or econphd. 

1.10 Business oriented readers may wish to 
consult Dixit and Nalebuff (1991), Branden- 
burger and Nalebuff (1996) or Dixit and Skeath 
(1996) while more traditional introductions to 
game theory with economic applications are 



Gibbons (1992), Binmore (1992) or Osborne 
and Rubinstein (1994). Advanced treatments, 
at the graduate and post graduate levels in- 
clude Fudenberg and Tirole (1991) and Myer- 
son (1991). The website www.gametheory.net 
offers online courses as well as review of text- 
books. 

1.11 An excellent online tutorial is available at 
Martin Osborne's website to refresh your mind. 

2.1 The limitation to only 2 factors is for exposi- 
tion purposes and involves no loss of generality. 

2.2 If the variations dK and dL are such that 
the pair {K + dK,L+ dL) remains in the iso- 
quant, then the total variation of production 
d<P = <£> L dL+<PicdK is nil. A simple algebraic ma- 
nipulation then yields equation (2.1). 

2.3 This term is used in opposition to the mar- 
ket price that, as we shall see, can be deemed 
objective because it depends on all the technolo- 
gies while this one depends on this particular 
technology O. 

2.4 Like any attempt to judge and compare, 
this theory builds on welfare economics (cf. 
§2.3.3) and more specifically on Debreu (1951)'s 
coefficient of resource utilization. 

2.5 The intermediary who provides the subcon- 
tracted activity incurs a fixed cost of maintain- 
ing a large number of employees available to 
meet any of his clients requests; since he bears 
a risk, he must charge his clients more than his 
own marginal cost. 

2.6 The short-term cost is always greater but 
equal to the long-run cost at one point. As can 
be seen on Figure 2.2, the curves touch but do 
not cross, so that their tangents are equal at 
q , that is to say the marginal cost are equals. 



More formally, this result is a consequence of 
the envelope theorem. 

2.7 Letting A = 1 + e for e > 0, the condition 
XC{q) > C{Aq) reads eC{q) > C{q + eq) - C(q) o 
ACiq) = ^f> CU?+£ g- CW -* C m iq). As for the 
second claim, footnote 26.3 shows that the sign 
of ^ is that of C m - AC. 

2.8 H1-H3 are necessary in the short run while 
HA has to be added only if one takes a long run 
perspective. 

2.9 In greater generality, the factor prices w 
and r also hinge on the marginal cost, thus 
profit is a function of the prices of all outputs 
and inputs. 

2.10 Since AC(q) = ^, the FOC defining q 
reads = ^ = fc^M c m {q) = AC{q). 

2.11 One could argue that if the marginal cost 
is always constant or decreasing, the minimum 
efficient scale is infinite but since there does not 
exists a technology that enables infinite produc- 
tion, the marginal cost must start to increase at 
some point, hence a finite value of q exists, al- 
though it may be very large, for instance larger 
than the world demand for a free product. 

2.12 Since total cost can be divided into fixed 
and variable parts C{q) = F + CV(q), the aver- 



age cost has also two parts; 



CV(q) 



has limit zero 



when production tends to zero while f diverges 
as soon as F > which is the case illustrated on 
Figure 2.3. 

2.13 Utility is the economic terminology for the 
satisfaction or pleasure we derive from consum- 
ing goods (commodities or services). 

2.14 We make the necessary assumption for u 
to be differentiate. 

2.15 This is so because either there exists an 
increasing function / with u = f{v) or an in- 
creasing function g with v = g{u). Comput- 



ing the MRS in each case reveals the equality 
thanks to the laws of derivation. 

2.16 Under the necessary assumption that 
make the problem well behaved i.e., having a 
unique solution. 

2.17 Mathematically, this is the Envelope The- 
orem or Maximum Theorem. 

2.18 Beware if that x* does maximize i?(x,A*), 
A* does not maximizes ,S?(x*,A), it is a saddle 
point. Hence Kalman (2009) argues it is a fal- 
lacy to believe that the Lagrange method re- 
duces constrained optimization into an uncon- 
strained one. 

2.19 This statement might be false for basic 
food if the consumer is quite poor, a combina- 
tion that we shall never consider in this book. 

2.20 The exact definition are related to the in- 
come elasticity of demand: inferior if e m < 0, 
luxury if e m > 1, normal otherwise. 

2.21 This revealed preferences reasoning is due 
to Samuelson (1947). 

2.22 Asking Julie's neighbors how many euros 
they would pay for each additional video rental, 
we obtain a long list of integers between and 
some maximum, say 100. Let t%i be the num- 
ber of times the figure i appears in the list and 
n = Ei<ioo n i> the maximum number of rental in 
this neighborhood aka. the market size D(0). 
We can now compute the frequencies h t = niln 
and cumulative frequencies H t = T.j<i hj in or- 
der to express the demand, for an integer price 
p, as Dip) = D(0) x (l - H p ). Passing to the con- 
tinuum in prices maintains this dual view of 
demand and distribution of WTP which will be 
used in §22 on auctions. 

2.23 It is possible to normalize b to unity; this 
is achieved by changing the physical unit from 1 
to b (if b = 4, it amounts to sell the good in packs 
of 4 units) so that the price of one new unit is 



bp. This trick would however prohibit a com- 
parative statics analysis of equilibrium quanti- 
ties with respect to price sensitivity of demand. 

2.24 Obviously, if there is an alternative pur- 
chase that generates a higher surplus then we 
ought to buy this alternative first. 

2.25 His idea arose from an attempt to per- 
form a cost-benefit analysis of public works (cf. 
§4.2.2). 

2.26 When utility is separable between money 
m (an index of other goods and services) and the 
good q i.e., Uj{m,q) = m+ Uj(q), the FOC of de- 
mand is u'Aq) = p which defines the WTP func- 
tion as the marginal utility; (2.17) is a restate- 
ment of this FOC in integral form. 

2.27 We use the letter W as in welfare because 
consumer surplus will be part of the market 
welfare. 

2.28 More precisely, Willig (1976) proves that 
consumer surplus is the right tool to evalu- 
ate the effect of a policy upon a single mar- 
ket. The percentage error made by using con- 
sumer surplus instead of Hicks (1942)'s more 
correct but hard to estimate equivalent varia- 
tion or compensating variation is bounded by 
t^AWd where AWo is the variation of consumer 
surplus brought about by the change of policy, 
m is the income of the individual and r\ his in- 
come elasticity of demand. This bound is likely 
to be small since either the income is much 
larger than the variation of consumer surplus 
or the income elasticity is small. On top of 
this theoretical finding, we can note with Hicks 
that measurement errors will be far greater 
than the conceptual ones because the quality of 
available statistics is often very poor. 

2.29 If we write consumer surplus as a fi- 
nite sum W D (q) = L q x=1 (PM - P{q)) = -qP(q) + 
T. q x= i PW, then the marginal variation is W D {q+ 



1) - W D (q) = -(q + l)P(q + 1) + qP{q) + P(q + 1) = 
-q{P{q+ l)-P{q)) which is the discrete version 
of the derivative in the text. 

2.30 He coined the term "consumer surplus" 
and introduced the remaing concept of this sec- 
tion: the supply curve, the producer surplus, 
the partial equilibrium and the notion of wel- 
fare. 

2.31 If the technology exhibits increasing re- 
turns to scale then any active firm will try to 
produce an infinite quantity because the aver- 
age cost tends to zero (hence goes below any 
positive price). This would make aggregate 
supply greater than demand and permanent 
disequilibrium would result. This particular 
case is studied in §17 that deals with regula- 
tion. 

2.32 Consider producing a total amount q 
for the monopolist. We know from the multi- 
plant problem (2.8) that the optimal distribu- 
tion {qt)i Sn among the n plants satisfies C m {q) = 
C l m {qi) = ... = C^iqn). If we denote p the cost so 
achieved, then the production of each plant is 
her competitive supply i.e., q% = Siip), hence the 
aggregate production q = Y,i< n Qi is exactly S{p). 
We thus proven that C m {q) = po q = S(p) which 
is to say that C m is the inverse of S. 

2.33 This was the original observation made 
by Dupuit (1844) to argue that tolls for bridges 
should be eliminated since their marginal cost 
is almost nil. This author also observed that if 
the current price is already inefficiently large 
like p2 on the right panel of Figure 2.5, then a 
further increase Ap generates a much greater 
loss of p 2 Aq+ \ApAq (or a large gain if we im- 
plement a reduction). 

2.34 Debreu (1951) using the duality of util- 
ity and cost shows that summing consumer sur- 
pluses over markets (the utilitarian approach) 



is a good proxy of an objective theoretical wel- 
fare measure that avoids making interpersonal 
utility comparisons. However, the existence of 
lump-sum transfers must be assumed in order 
to be able to compensate agents who would suf- 
fer from policy changes (cf. Boccard (2010a)). 

2.35 There is tacit agreement in the profession 
since all manuals we have been able to consult 
gloss over the issue. 

2.36 For a unique player, one speaks of decision 
theory. 

2.37 With more strategies, we use the "domi- 
nation" relation to eliminate some strategies. If 
one one remains, it is dominant strategy. 

2.38 For ®, there are 4 cases i.e., one per ma- 
trix entry. For ®, there are 8 cases = 2 players x 
2 possible dominant strategies x 2 possible op- 
timal choice for the other. There are two cases 
for each of the remaining classes, making 16 in 
total. 

2.39 There are two pure strategies equilibria 
and one mixed strategy equilibrium which we 
do not develop. 

2.40 A single valued relationship is called a 
function while a potentially multi-valued rela- 
tionship like BRi is called a correspondence. 

2.41 We are assuming the existence of money 
as a perfect mean to transfer utility (or satisfac- 
tion) i.e., the swapping of l€ between farmers S 
and N decreases the utility of farmer S as much 
as it increases the utility of farmer N. 

2.42 Notice that a simple translation of the 
axes puts the origin at the threat point as if 
the opportunity cost of both players was nil. In 
many applications of bargaining, this is done 
without loss of generality. 

2.43 The traditional economic definition of a 
"quasi-rent" speaks of a payment that is re- 



ceived by a resource of production over the op- 
portunity cost in the short run. 

2.44 When S s = S N , we may interpret 1 - 8 as 
the share of the pie wasted in having the nego- 
tiation lasting one more period. 

2.45 We use the language of agency theory as 
this model is used in Part H. 

2.46 The formal model sets tin = ^nPn^n) - 
C N {q N ) and n s = qsPsiqs) - C s {qs,z) where the 
emission level z is a deterministic function 
/{c/n) of iV's level of activity. If N is not liable 
for z, he chooses cjn solving R^iq^) = C%(cIn), 
thus acting as a Stackelberg leader. In this 
setting, S is a follower and chooses qs solving 
Rm(qs) = C^(<7s.<7n)- Efficiency commands to 
maximize tin + tis in which case the first FOC is 
modified into R%(q N ) = C^{q N ) + ^. The solu- 
tion involves q* N < q^ and q* s > q s ; emissions are 
reduced by f(cjN) - f(q* N ) but never fully elimi- 
nated. 

2.47 Although it looks unequal, an efficient 
agreement sees S aks N to limit emissions to 
z* = fiq^) in exchange of a payment F = n N -n* N . 
Upon accepting the offer, A would produce the 
maximum compatible with the agreement i.e., 
q* N and earn n* N + F = n^ which is enough for N 
to accept the initial offer. In that case, B earns 
n* s - F = tts + 8 i.e., S grabs all the benefits from 
efficiency even though he still has to pay N. 

2.48 We may also cite social cohesion, repu- 
tation and repeated interaction as mechanisms 
used to achieve cooperation. 

2.49 In the light of the modern theories of com- 
mitment and reputation, we can make sense 
of this last argument: the individual will bond 
with his peer group when he recognizes that 
short-term IR behavior will be met with a large 
sanction such as lasting ostracism. Hence, the 
social dilemma is solved if the group has the 



means to avoid free riding. 

2.50 From the standpoint of modern game the- 
ory, this position is shaky for if incumbents find 
it beneficial to integrate the cartel, couldn't it 
be best for an entrant to apply also ? 

2.51 Tuck (2008) ascribes this modeling choice 
to the 1791 revolutionary Le Chapelier law 
that outlawed combinations, associations and 
guilds. 

3.1 Interestingly, he also reports that city 
states already sell monopoly franchises to raise 
finance (cf. §16.4.1). 

3.2 It might be the case that the challenger will 
displace the incumbent and become itself the 
new monopoly. 

3.3 Microsoft's loud speaking CFO, Steve 
Ballmer, once said "We don't have a monopoly; 
we have market share, there's a difference", a 
thin one we shall argue. 

3.4 The terminology derives from rents earned 
by landlords. 

3.5 Mathematically speaking, the volume effect 
is of first-order while the price one is of second- 
order. 

3.6 Since P is the inverse of D, their derivative 
are inverse one from the other. 

3.7 Equation (3.4) reads [p - C m )e = p o {p- 
C m )P{q) = -pqP'(p) o p-C m = -qP'(p) o C m = 
R m because p = P{q). 

3.8 By the first and second theorems of welfare, 
a Pareto optimum is also a competitive equilib- 
rium but since there is only one firm, the mono- 
poly, it sounds weird to pretend it could act as a 
price-taker; for that reason we prefer to speak 
of Pareto optimum or an efficient allocation of 
resources. 

3.9 Dupuit (1844)'s original "utilite perdue" 
(lost utility) is turned into "perte seche" (net 



loss) by Colson (1901-1907), then translated 
into "dead loss" by Edgeworth (1910). This area 
is also known as Harberger (1954)'s triangle for 
the stubborn effort of this author to measure 
empirically this loss at an aggregate level (cf. 
Hines (1999)). 

3.10 A still older case seem to be an Egyp- 
tian regulation prohibiting papyrus from being 
planted in too many places. 

3.11 Reacting and adjusting to ever changing 
external conditions is time and effort consum- 
ing while the reward is meager because at the 
maximum, profits change slowly (this is due 
to the envelope theorem); the firm owner may 
thus rationally permit some slack in the search 
for maximum profits. 

3.12 The neutral name is chosen to combine all 
forms of inefficiency that are not allocative i.e., 
not due to the pricing behavior of the firm. 

3.13 The exception would be when the qual- 
ity attribute is "shinny" and signals the over- 
all high price of the item whose ownership then 
serves as a social signal (cf. §24.5). 

3.14 This claim cannot be transposed to a sym- 
metrical statement contingent on quantity be- 
cause the consumer surplus depends on qual- 
ity in two non comparable ways as C{s,q) = 
f«[P(s,x)-P(s,q))dx. 

3.15 Starting from the quantity-quality pair 
which is optimal for the monopoly, the planner 
can firstly increase quality to the efficient level, 
conditional on the monopoly quantity. Then 
he starts to increase quantity while adjusting 
quality downward because the two are substi- 
tutes. If the total increase in quantity is large 
enough, then quality will fall so much that it 
will end up lower than the original monopoly 
level. 

3.16 This is w.l.o.g. if the cost of developing 



and implementing quality are convex because a 
rational choice, guided by profit considerations, 
will always be less than what the current state 
of technology would be able to achieve. 

3.17 Notice that our example falls outside the 
previous categories as f-§^ = 0. This separabil- 
ity of q and s in the WTP will however enables 
to work out an analytical solution. 

3.18 To do this the firm must use carriers of 
minimum size qls. To simplify, we assume that 
demand is homogeneously spread over the pe- 
riod so that each carrier can travel at 100% ca- 
pacity. 

3.19 This does not contradict the general result 
enunciated above because it was contingent on 
price while here the contingency is quantity. 

3.20 There is a slight abuse of language here 
since we do not account for capital cost (carrier, 
6) but only variable cost (people, c). 

3.21 cf. One concrete example is given in 
Markoff (1990), p449, who concludes that at the 
end of ancient regime France in 1789, the crowd 
voted in favor of taxes in return for effective 
property rights and public services but strongly 
opposed lump sum payments to racketeers such 
as Farmers-General. 

3.22 Both use coercion over individuals to en- 
force compliance but with varied levels of phys- 
ical violence. 

3.23 Brennan and Buchanan (1977), and the 
public choice literature after them, refer to the 
ruler as the Leviathan in homage to Hobbes 
(1651). 

3.24 A net tax rate of 50% has two dimensions. 
Firstly, it may be the outcome of taxation at 
80% whereby 30% of the proceeds are used to 
enforce taxation i.e., quash revolts and tax eva- 
sion. Secondly, anticipating such a high level 



of taxation, economic agents substitute part of 
their time towards non taxable activities which 
are inherently less productive. 

3.25 The first best situation implicitly assumes 
full cooperation among consumers-producers 
and the ruler for decision making and assume 
away wealth effects. In that case, it is vi- 
able to finance public goods via a lump sum 
tax t so that the people's utility is U = (1 - 
p)D{p,s) - t whereas the ruler's net wealth is 
n = t+ pD{p,s) - s. Welfare W = U + n is thus 
as presented in the text. In this ideal setting, 
the ruler is a civil servant collecting taxes to fi- 
nance public goods. 

3.26 We assume R {p°)<S>' (0) > 1 to avoid a corner 
solution. 

3.27 As Rat.) > R i), we have R (p°) < R a (p°) < 
R a ip a ) because p a is the maximizer. Now, since 
O' is decreasing, the result obtains. 

3.28 Since p a < p°, the relevant range is [0;p°] 
over which pr(p) is increasing. 

3.29 It is also the solution of p a r(p a )$>{s a ) = 
s a o a= y^-jj after simplifications. 

4. 1 This is because, under uniform pricing, the 
relationship between quantity and total price is 
linear in the mathematical sense. 

4.2 It shall be proved later that it is better to of- 
fer the same conditions to anyone within a seg- 
ment than randomly experimenting. 

4.3 They need not be so temperamental; as ra- 
tional economic agents, they have an incentive 
to invest time and money to avoid, by whatever 
means, the higher price they are being asked to 
pay 

4.4 Under the racial discrimination of 
apartheid, blacks were barred from using reg- 
ular public transportation and were forced to 
use separate carriers; the service being worse, 



it was as if they had to pay more to use the regu- 
lar bus. Under economic discrimination, the re- 
verse would happen: whites would have to pay 
more because their skin color reveals they be- 
long to a wealthier group whose WTP is higher. 

4.5 Article 13 of the EU Treaty allows the EC to 
take measures to combat discrimination based 
on sex, racial or ethnic origin, religion or belief, 
disability, age or sexual orientation. 

4.6 There is also a cost explanation to this pric- 
ing since the frequency of accidents is higher in 
urban areas. 

4.7 To be sure, the manager can abate all prices 
by one cent in order that each rental generates 
a surplus of 1 cent. This makes each rental 
strictly superior to not renting for the client. 

4.8 Club goods are studied in §5.3.2. 

4.9 This is mostly a theoretical view since a 
firm can hardly sustain the reputation of mak- 
ing one and only one offer and walk away from 
profitable trade should it be refused. 

4.10 cf. also footnote 26.3 showing that bilat- 
eral relationships are efficient when enough in- 
struments are available. 

4.11 For the case where the marginal cost is 
increasing, we have to determine the efficient 
total quantity Q* and use c= C m {Q*) in the pre- 
vious schemes. 

4.12 He also proposes as an alternative, to 
lower the tariff between 7h to 8h and between 
17h to 18h which are the peak periods where 
the large worker population crosses the bridge 
to reach factories and go back home. In §25, 
we deal with seasonality and show that it gen- 
erates congestion which in turn results in an 
inefficient use of the economy's scare resources. 
We also review the solutions to this inefficiency. 

4.13 We drop the "M" superscript label and use 



instead "d" and "u" to indicate optimal choices 
under, respectively, discriminatory and uniform 
pricing. 

4.14 This is a revealed preference argument 
quite common in economics. 

4.15 When marginal cost is not constant, we 
obtain q% = — ' 2 " ,( ' ?o) and summing, we get 
C m {qo) = a \ q " whose unique solution qfi 1 deter- 
mines the optimal discriminatory scheme. 

4.16 As is often the case, competition among 
retailers is more intense in the integrated US 
market as compared to the scattered european 
one. 

4.17 Because a second hand market for re- 
trade would be active proving that the original 
allocation was not a Pareto optimum i.e., ineffi- 
cient. 

4.18 If the firm can manage to sell exclusively 
to the strong segment, she earns 5/4 which dom- 
inates uniform pricing if 8 < |. 



4.19 As we have seen in other instances, 



dT 







at the optimum T* and W = n + W D thus fy = 
^ = d[T, p 2 )-d(T, pi) > Oat T*. 

4.20 Affirmative action, though akin to posi- 
tive discrimination, works through direct reg- 
ulation. The government imposes a minority 
quota on institutions who then use positive dis- 
crimination to motivate the participation of mi- 
nority members in order to meet their quota. 

4.21 The cheapening of automated (computer 
driven) systems generated by the IT revolution 
has reduced the cost of billing as well as the 
client's inconvenience so that we see more and 
more firms introducing complex schemes of dif- 
ferential pricing. 

4.22 Regarding the consumer, under BOGO he 
spends 18€to get 21€ worth of pizza, whereas 
if he buys only one unit at price p, he gets 



14€ worth of pizza plus 18 - p€ left, thus BOGO 
hurts him only ifp<14 + 18-21 = ll. 

4.23 The exact condition is f 1 > p M o L>i(p M ) > 
which must hold true given that we computed 
p M under the implicit assumption that both 
groups are consuming. 

4.24 The design of the optimal three-part tariff 
is not over yet because the monopoly must now 
optimize the (F, A) combination. 

4.25 From W Dill [p) = J™D^x)&x, we get g = 
/r^dx = F'-^dx, hence e M - «J = 



/~e„ (*)£>,, (x)dx . 



f™D^{x)dx 

ties. 



is an average of income elastici- 



4.26 This observation also known as the rev- 
elation principle is studied in §21 on adverse 
selection. 

4.27 We do not consider pricing lists since they 
are absolutely equivalent to two-part tariffs in 
this setting. 

4.28 One could think of using two-part tariffs 
and lowering F h , yet this will be useless unless 
one unit price, either pi or p% is changed. 

4.29 As we are dealing with marginal changes, 
one must imagine extremely thin trapezoidal 
areas. 

4.30 Personalized prices depend on past ac- 
tions in an implicit rather than explicit way. 
Sophisticated and experienced consumers are 
able to predict the effect their actions will have 
on their subsequent deals, and adjust their be- 
havior accordingly. Dimwits or first-timers fail 
to account for this linkage and allow firms to 
exploit their myopia. 

4.31 Switching cost are also what keeps us with 
a supermarket, a gas station, a mobile phone 
brand, a car brand or any other dealer service 
we regularly use (cf. §5.1.2). 



4.32 Care must also be taken to show that 
P\ < p M was not optimal. 

4.33 The firm faces twice the same demand 
thus repeats the optimal one shot price what- 
ever the degree of sophistication of consumers. 

4.34 Recall also that the firm does not wish 
to use commit power to discriminate inter- 
temporally 

5.1 Buyers go to the cheapest price forcing the 
dear firm to reduce her price if she wants to sell 
a positive quantity. 

5.2 Rename the two farmers a and /3, derive 
the best reply of a facing (5 and then set alter- 
natively a = 1,13 = 2 and a = 2, p = 1 to obtain the 
two best replies. 

5.3 The mathematical definition that general- 
izes this example is due to Nash (1950) and hold 
this author's name to distinguish it from other 
forms of equilibrium. 

5.4 By the first welfare theorem, the same wel- 
fare would obtain if the two firms were behav- 
ing as price takers. 

5.5 Obviously, a "first-time" user must incur a 
start-up cost but in all likelihood, it is larger 
than any future switching cost. 

5.6 Even for two identical products, we tend to 
adapt our preferences towards what the one we 
know in order to reduce cognitive dissonance. 

5.7 The profusion of suppliers is due to the very 
small cost of entry; it is enough to set up an e- 
commerce solution (cf. §6.1 on entry). 

5.8 In the example where D{p) = 1 - p and 
marginal cost is c, we have p c = ^^ so that 
the limit where the price starts to decrease is 

8=^± 

u 3 . 

5.9 Although the same phenomena took place 
in Europe, the fragmentation of this market 



makes data collection more difficult. 

5.10 All the data from Table 5.1 was gathered 
in the news section of the CNET website. 

5.11 Footnote 26.3 proves that this is the rele- 
vant case for the equilibrium analysis. 

5.12 If prices are too large then the indiffer- 
ent consumer x has a negative utility from con- 
sumption i.e., the individuals living in the mid- 
dle area do not consume and the market fails 
to be covered. Theater A's demand is then the 
address solving u A {x) = i.e., x= ^ z f A - Profit is 
then proportional to (pa - c)[p- pa) leading to 
an optimal price ^ and sales of ^f. A suffi- 
cient condition for full market coverage in equi- 
librium is p > c+ 1 in order that each firm is will- 
ing to monopolistically serve more than one half 
of the market. 

5.13 In the case where cost are asymmetric, 
profit is 7i A = Yt^PA~ c A)it+PB-PA), thus the best 
replies are 2p A = t+ c A + Pb and 2ps = t+cg + pa, 
so that the equilibrium is p* = t + 2Ca 3 +Cb . The 
margin is p* A - ca = t + Cb ~ Cj4 and the equilibrium 
profit is Ji* A = Yp (3 1 - c A + c B ) 2 = 2 tq\. 

5.14 This is without loss of generality. In- 
deed, the efficient quantity is the demand when 
prices are set at marginal cost c i.e., q* = a- be; 
thus we can interpret a as the efficient quan- 
tity 

5.15 i = ^f|§^ < b + d o bib + 2d) - [b + df = 
-d 2 <0. 

5.16 This model remains ad hoc since the pro- 
cess of writing and negotiating contracts with 
potential clients is not tackled. 

5.17 Recall that by definition of the competitive 
supply which equates price to marginal cost, 
units beyond that point generates losses. The 
rational behavior, if acceptable for consumers 
and authorities, is to ration demand and serve 
at most the competitive supply. 



5.18 The laws of physics force the produc- 
tion and transportation of electricity to match 
instantaneously the household demand which 
is highly variable due to its correlation with 
weather conditions. 

5.19 We choose C,„(0) = w.l.o.g. because it 
amounts to study profit margin p - c instead of 
price. 

5.20 If a is very small, the equation has no so- 
lution which simply means that the market re- 
mains inactive; any price is then acceptable. 

5.21 The other root being negative, the result- 
ing supply function would be decreasing which 
is not permitted. 

5.22 Indeed, p c > 



■,SF 



aq+bS) 



p~ > p-" o b{3+bS) > 2 p +l 



P> T£ib *> Vb 2 + 4b/8 > i% + bo b 2 + 4l > 
(jrk + b) 2 <>0< (b 2 + 4l)a + 8bf-b 2 (3 + 8bf = 
4 1 which is true. 

5.23 A club shares similarities with a pub- 
lic good but also differences as there is volun- 
tary membership, congestion, exclusion of non- 
members, billing of members and collective de- 
cision of the size of the club (cf. Sandler and 
Tschirhart (1997)). 

5.24 Even on the highway, we feel more secure 
from having mild traffic around us. 

5.25 The Samuelson condition for a public good 
states that the marginal cost should equate so- 
ciety's WTP which sums all member's WTPs. 
Here, we have a "local" public good since only 
club members enjoy the service. 

5.26 There is a solution because ^P tends to 
zero as n grow large while u n does not. 

5.27 When clients switch from a low utility club 
A to a high utility one B, they increase conges- 
tion at B thus reduce the utility there while 
they reduce congestion at A which increases 
utility there. Free flowing among clubs thus re- 



duces the utility differential (cf. manna trick 
§2.1.1). 

5.28 With reference to the classical Edgeworth 
box, the firm and the client trade over good 
and money. Because price is not involved in 
such a barter, a Pareto optimum (on the con- 
tract curve) is characterized by equality of the 
RMSs i.e., marginal cost equal to WTP. Money 
is then available to share the gain from trade in 
any proportion (cf. also §4.1.3). 

5.29 The curve -u n -^zi is above -u n thus cuts 
^=r sooner. It must be pointed out that since we 
treat m as a continuous variable, the free entry 
and efficient number of clubs may coincide. 

5.30 Although a regime of perfect competition 
is in complete contradiction with the minimum 
size of clubs, we may nevertheless check that 
it is conducive to efficiency. Indeed when clubs 
are utility-takers, the market clearing equation 
is simply u{k, n)- f = u. This leads to u„{k, n) = 
J^ and a fee exactly equal to the congestion 
cost. The free-entry equilibrium then yields the 
Pareto condition for club size: TWTA = AC. 

5.31 This can be time (phone, radio, TV), soft- 
ware, consumables such as ink, repair services, 
upgrades, and other products and services that 
may be related to the intensity of product us- 
age. 

5.32 In a fully specified model of competition, 
demand depends on the strategies of all active 
firm. 

5.33 We implicitly assume that usage volume is 
independent of income i.e., the marginal utility 
of income is constant. 

5.34 Because we have Df = VfX cte. 

5.35 A correct game theoretical analysis of 
oligopolistic interaction of price discriminating 
firms must take as strategies the very tariffs 



that are offered by firms to customers. Spulber 
(1979) fails that standard when setting output 
as the strategic variable; his proof that perfect 
price discrimination is efficient in oligopoly is 
thus incorrect. 

5.36 This shall never happen in an Hotelling 
model since we always assume complete cover. 

6.1 In practical applications, the expenses on 
governmental licenses ought to be left out since 
they merely represent a transfer among eco- 
nomic agents. The pointillist may count them 
at the marginal cost of public funds (cf. §17.1.2). 

6.2 Since there is no reason why this number 
should be an integer, the long-term number of 
competitive firms is in fact the integer part of 
n* while the long-term price and the long-term 
individual supply are slightly larger than p and 
q. 

6.3 For any other market behavior, the indirect 
effect is not nil. 

6.4 Since the market size is finite and demand 
goes to zero for very large prices, market sales 
are limited by some maximum volume V, so 
that for n > V IF, all firms make losses which 
implies that the process of free entry has a fi- 
nite limit. 

6.5 The multiplier is exactly 2 when the tech- 
nology has constant returns to scale (cf. eq. 
5.12) and is slightly greater when there are 
decreasing returns to scale, since it can be 
shown that the equilibrium price obtained in 
(6.8) yields the profit margin p c n - c = ^ ^^ 
which increases when a and b duplicate. 

6.6 A more complex theory would be needed to 
assess which ones are to leave first. 

6.7 The issue of quantity competition in the 
presence of asymmetric technologies is treated 



in §5.1.3 where it is shown that firms with the 
better technology increase their market share 
and profits upon integration. The same forces 
are at work but without such an extreme out- 
come. 

6.8 A lower price means greater demand thus 
greater market sales; even-though individual 
firm sales might be smaller, this effect is more 
than compensated by the entry of new firms. 

6.9 The French licensee Bic Cristal also did a 
fortune with a cheap but reliable version. 

6.10 Baumol (1982)'s buoyant presentation of 
contestability as an uprising against heavy reg- 
ulation must be interpreted in light of the polit- 
ical momentum of the first Reagan administra- 
tion and its wave of deregulation (cf. §16.4.4). 

6.11 Prices depart from marginal cost to cover 
the fixed cost (cf. §17.2). 

6.12 An alternative hypothesis yielding the 
same conclusions is stickiness (H2'): the incum- 
bent cannot adjust its price as entry occurs. 

6.13 The idea goes back to Bain (1956) who 
states that "most analyses of how business com- 
petition works and what makes it work have 
given little emphasis to the force of potential 
or threatened competition of possible new com- 
petitors." 

6.14 Furthermore, as we explain in §10.1, this 
behavioral assumption is incoherent within a 
model of interaction between rational decision- 
makers. 

6.15 PC is like Cap Horn, a site reachable from 
the pacific, the atlantic or the mainland, far 
from Mexico but still connected to it. Contesta- 
bility, on the other hand, is like Easter Island, a 
place filled with symbols of a long gone civiliza- 
tion, alien to ours and useless to understand it. 



6.16 We find 20 entries for the period 1980-89 
and only 4 thereafter in JSTOR with "contesta- 
bility" or "contestable" in the title. 

6.17 Baumol et al. (1982) initially show con- 
tempt when speaking of "the reaction func- 
tions and the other paraphernalia of standard 
oligopoly model". Later on Baumol and Willig 
(1986) more humbly recognize that "contesta- 
bility... accomplishes its objectives via a pro- 
cess of simplification, by stripping away all bar- 
riers to entry and exit, and the strategic be- 
havior that goes along with them both in the- 
ory and in reality". Regarding policy recom- 
mendations, they backpedal when "vehemently 
disagreing with a view of the world associating 
... contestability with an all-pervasive laisser- 
faire position on the role of regulation and an- 
titrust". 

6.18 Temin (1997) recalls that Baumol was the 
chief economist for AT&T; he gave testimonies 
to the FCC from 1966 until 1996. Schechter 
(1996) recalls how he argued for theories and 
cost allocation formulas (ECPR) that benefited 
his employer, even though he was aware it 
amounted to blocking entry. 

6.19 Microsoft reveals for 2004 that the "Win- 
dows" and "Office" divisions ("client" and "In- 
formation Worker" in Microsoft jargon) display 
rates of profits of respectively 75% and 72% 
which is far above the one digit values observed 
in the vast majority of markets (cf. also CNET 
report). 

6.20 Whether the technology fits into the gen- 
eral plans of Microsoft and will effectively be 
used is of secondary importance. 

6.21 The threshold for the fine F is nf [q[,qf) - 

n C ( n S n S\ _ (_9_ _ II (a-bcf _ {a- be) 2 _ (<7i-qf) 2 T 
u l VH\'H2i ~ 1.64 8> b ~ 64fo ~ b ' 10 

estimate this amount we can use e the elastic- 
ity of demand, Q the actual total sales, p the ac- 



tual price and Aq the number of rationed con- 



sumers to derive F 
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Notice how it de- 



pends quadratically in the rationing created by 
the nonfulfillment of farmer 1. 

6.22 The first truly forward market was estab- 
lished by the Chicago Board of Trade in 1851 
for corn. There are now hundreds of forward 
markets all over the world. 

6.23 A forward contract is a derivative secu- 
rity defined by three items: an underlying as- 
set such as oil or the equity of a firm, a forward 
price and a maturity date. The "long" side has 
the obligation to buy the asset at the agreed 
price on the agreed date; the "short" side has 
the reciprocal obligation of delivering the asset 
to receive the payment. 

6.24 If there is agreement on a price of 10€ over 
17 units, the seller will pay an additional 34€to 
the buyer if the spot price is 12€ while the buyer 
will pay an additional 51€ to the seller if the 
spot price is 7€. Each has thus insured him- 
self the ability to transact 17 units at the price 
of 10€, whatever the future spot price. 

6.25 A participant to the market who is neither 
a producer of the good nor a final consumer. 

6.26 We assume that all transactions, forward 
and spot, are settled in the second period so 
that no discount factor is needed. 

6.27 We use the model of Bertrand compe- 
tition of §5.2.3 where the demand addressed 
to firm i is qi = a- bpt + dpj; we denote nt = 
piqi. With a forward position f, the profit 
is Ttt + (T; - pt)fi and its maximization leads to 
Pi = BRi(pj) s ^4^; note that d -§^ = f b > 0. 

As ft = S = It ~ P b and S| = P d > °> we de- 
duce j^ = | (qi-fi)- Since there is no arbi- 
trage in equilibrium, t, = pt holds true so that 
total profit is %i. The FOC for eliciting for- 

ward sales is 0=t = t(fe + lt^)- = 



fi+f b i(<n-fi)^°>fi=^>-<ii- 

6.28 Recall that owners empower managers to 
take market related decisions because they lack 
information regarding demand and costs. Own- 
ers would like to make sure that managers will 
act so as to maximize profits. Chapter 13 and 
Part H analyze this difficult topic. 

6.29 The absence of variables relating to the 
competitor can be explained by the difficulty 
with which this information could be obtained 
and above all by a legal requirement from anti- 
trust authorities to avoid collusion (cf. §9). 

6.30 If c is very small then the optimal choice is 
simply at = i.e., the manager maximizes sales 
without taking care of cost. 

6.31 This is not a result but a coincidence due 
to our linear modeling; recall that forward sales 
distort market size while managerial compen- 
sation distorts cost. 

6.32 Under integration wt = so that raising 
it a little bit is profitable since fg = §g£ = 

6.33 The manufacturer might even try to use 
negative prices by giving for free other services 
to the retailer (but on a quantity basis to main- 
tain the sales incentive. 

6.34 This might be incorrect if bankruptcy gen- 
erates a deadweight loss. 

6.35 cf. §23 where our financial terminology is 
introduced. 

6.36 Formally, using compact notations for 
partial derivatives, equation n{x,y,9) = D im- 
plies B x = -*£, thus S xx = f™n xx dH(a) + x n x = 

2 

fe° n x% dH{a) - ^ < assuming as usual that 
7l xx < 0. Now, X D 
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>0. 



6.37 This is the famous irrelevance theorem of 
Modigliani and Miller (1958). 



7.1 It remains puzzling that standard text- 
books still offer no unified treatment of ri- 
valry when the research literature is burgeon- 
ing with such models applied to all areas of mi- 
croeconomics. 

7.2 Konrad (2007) and Garfinkel and Skaper- 
das (2007) offer surveys of contests and con- 
flicts respectively. The first known mathemati- 
cal analysis of conflict is Borel's Colonel Blotto 
game regarding the optimal allocation of forces 
among war theaters. 

7.3 Recall that in a conflict, the rules of en- 
gagement are determined by the existing envi- 
ronment and the winner's bounty is less than 
the loser's prejudice i.e., wealth is destroyed by 
conflict. 

7.4 Thief have an opportunity cost of time, thus 
socially waste their effort at becoming better 
thieves. Likewise, victims, and the State de- 
fending them, invest into private and public 
protection. 

7.5 Tullock (1975) originally introduces the 
probabilistic lottery for litigation between two 
agents. 

7.6 As shown by Boccard (2010b), a CSF satis- 
fying independence wrt. irrelevant alternatives 
and consistency with the removal of a dummy 
player must be separable. If we further require 
currency independence then the influence tech- 
nology must be a power function (as used be- 
low). 

7.7 When h is convex, the party having in- 
vested slightly more has a disproportionate ad- 
vantage to win the prize. This is a realistic as- 
sumption for violent conflicts but not so much 
for economic ones. 

7.8 At this point, we may reinterpret the pure 
influence lottery of (7.2) as the unique equi- 



proportionate success function i.e., such that 
Q- = f 1 is true for any pair of participants. 

7.9 For the record, if a = 2, (7.2) reads e\ei = 
(e - e{)Vi thus e,- = -^-. Summing over all par- 
ticipants, we derive e as the unique solution of 
the equation 1 = X ,■ - S J —. 

7.10 Recall that total effect is §g = % + *£§£. 

7.11 cf. Cooter et al. (1982) for an early exposi- 
tion of this issue in the presence of asymmetric 
information. 

7.12 Stoft (2008), in the context of carbon tax- 
ation, speaks of an untax to emphasize the dis- 
connection to usual taxes. More on taxes as in- 
centive instruments in §18.1.4. 

7.13 Oddly enough, Samuelson (1947)'s text- 
book uses this colorful reference to illustrate 
the first and foremost principle of neoclassical 
economics, namely that people face tradeoff. 

7.14 cf. Vahabi (2005) for a recollection of 
the theory developed in this hard-to-find book. 
Another forerunner is Bush and Mayer (1974) 
state of nature model where private property 
does not exist (cf. §26.1.5). 

7.15 We use the vocable " creation" instead of 
" production" in order to be able to speak of " 
productivity" for both creation and capture . 

7.16 In Hirshleifer (1988) and the literature af- 
ter him, Aj is an index of creative productivity 
instead of being a negative index of influence. 

7.17 The rent-seeking FOC is ^ = A<of 
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7.18 Letting \i i and (p t respectively denote the 
capture and creative productivities, the equa- 
tions remain the same with A, = — . 

Pi 

7.19 In the case of violent conflict, it corrob- 
orates the observation that warlords have al- 
ways enjoyed more wealth than laborers and 
also that war technology has always been at 



the vanguard of a society's advancements. The 
latter stems from the fact that a marginal im- 
provement can yield disproportionate returns 
when conflict technology is highly non linear. 

7.20 Uniqueness arise from jM being decreas- 
ing whereas the fact that it is convex implies 

l M l+ i l >Y = a> M l+ f^P thus we 

have e* Q < '""^"^ = (n - 1) ! L '^' (the solution to 
a = Ms lesser than that of b = ^). 

7.21 If the appropriation process was insensi- 
tive to influence then parties would not try to 
influence the result and full efficiency would 
be achieved. To see this in the duopoly case, 
it is enough to change the lottery into pi = 
1 ~<P + (Pyty- We have totally insensitivity to 
influence for (p = and the pure lottery for <p= 1. 
Performing the calculations, we obtain q = ^r. 

7.22 In violent conflicts, parties often settle to 
avoid destructive war but it is clear that this is 
an "armed peace" or a "cold war" equilibrium. 

7.23 This widely cited paper uses an analyti- 
cal framework quite different from the canoni- 
cal one and also less straightforward. 

7.24 There is a clear similarity with the "busi- 
ness stealing" effect in the Hotelling model 
where gaining one client is necessarily the loss 
of one for the competitor (cf. §5.2.2). 

7.25 This phenomenon is quite similar to 
Hotelling competition. As shown in footnote 
26.3, when firm A improves its technology (low- 
ers marginal cost c A ), it lowers its price and in 
response so does firm B. In the new equilib- 
rium, both firms are more aggressive, but the 
innovative one enjoys a greater profit at the ex- 
pense of the other one. However, the change in 
market shares is only one sixth of the cost im- 
provement since x = \ + Cb ^ a ■ 

7.26 Muck like in the Tragedy of the Com- 



mons where inefficient over-exploitation ceases 
as soon as property rights are established and 
allocated to a monopolist. 
7.27 Alternatively A includes the government 
and civil servants who are faithful to the politi- 
cian since they owe their job to his safekeeping 
power. 

7.28 If, as a matter of example, we let 
G B {9) - G A {8) = 9/p, then 0{p) = \ip. Since 
n B = v B H[p(e B - e A )) - \X B e 2 B , the FOC is 
h{p{e B -e A )) = tef. As n A = v A {l-H{pp))- 
\X A e\, the FOC is h[p(e B -e A )) = ^f. Com- 
bining, we get X B v A e B = X A v B e A , very much 
like the cost minimizing capital labor ratio for 
a Cobb-Douglas technology. Substituting in 
either FOC, we obtain the equilibrium levels 
of pressure as the solution to h{ae A ) = ^f^ 
with a s pi^f- - l) (assumed positive to make 
sense). A worsening of lobby i's technology 
(A; /) leads to e* \, e* /, n* \ and n* /. 

1 J ' J 

7.29 The best reply 4> B {e A ) satisfies <p B {0) > 
and has slope lesser than unity because a one- 
to-one response only maintains net pressure 
while being costly. The best reply (p A {e B ) sat- 
isfies <p A {0) = and has slope lesser than unity 
as before. As a simple geometric representation 
shows, there is an equilibrium with e* > e* A > 
0=>p A >0=>8* = f)(p A )>0. 

7.30 Formally, this lottery is identical to the 
differential productivities form alluded to in 
(7.1). 

7.31 Consider an externality where parties 
haggle over the level q emitted by B (value 
function U) and weighting on A (cost function 
C). Whereas B would like to pick a large q, A 
would like to pick a low level q. We thus have 
V B = U(q) - U{q) and V A = C{q) - C(q). 

7.32 Since Ay = 1 - A,, the numerator of k* de- 
creases while the denominator increases. 



7.33 Recall that in a world of zero transaction 
cost, initial ownership does not matter. 

7.34 For the externality, W* = max q U[q) - C{q). 

7.35 An integration by parts shows that the ex- 
pected discovery time is the inverse of the haz- 
ard rate i.e., E[t] = j±q. 

7.36 The density Pr(r = t) = h(x)e- h{x)t satisfies 
fgze~ zs ds=l-e- 
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7.37 This is a consequence of the mixed strat- 
egy equilibrium followed by firms. 

7.38 Observe now that n w {w s ) + n i{w s ) - 
n n {w s )-7i 00 (w s )= c( f_ ~ qo) > thus 7i w (w s )-c> 
7io {w s ) + 7i u (w s )-c-7i 01 (w s ) > jtoo(^ s ) by (7.14), 
hence we have shown that working dominates 
shirking when the other shirks. It is therefore 
impossible that both agents shirk and if one 
works the other prefers to work which proves 
that w s implements mutual effort at minimum 
cost. 

8.1 Absence of rights is often characterized by 
an open access regime. 

8.2 For natural resources and environmental 
assets, one may hold a right to use, a stronger 
right to harvest or even the right to manage the 
asset. 

8.3 The driver seem to be reciprocity, a univer- 
sal, spontaneous, and evolutionary fit behavior. 
Experimental economics has shown that sub- 
jects tend to cooperate in the prisoner dilemma 
(as well as other social dilemmas) and that vol- 
untary contribution for public goods are much 
larger than static models of game theory would 
predict (cf. Camerer and Loewenstein (2002) or 
McKenzie's contrarian point of view). 

8.4 One could argue that the incumbent values 
the asset highly because he invested much in it 
but this is a sunk cost fallacy. Indeed, to max- 
imize future returns, one ought consider only 



future payoffs of the asset irrespective of how 
much was expended on it in the past. 

8.5 In the language of game theory, first posses- 
sion is a correlation device that bring parties to- 
wards a correlated equilibrium without conflict 
(cf. bourgeois strategy in the hawk dove game 
or "tit-for-tat"). 

8.6 Shleifer (2010) is an excellent read, all the 
more given the "laissez-faire" stance of this au- 
thor in his early days. 

8.7 At a different level, the world trade or- 
ganization (WTO) is an arbitration court with 
no formal authority upon countries whose de- 
cisions are however upheld by all. The reason 
here is the powerful deterrence effect of reputa- 
tion; if a country refuses to abide, others will re- 
taliate either directly with import taxes or im- 
plicitly by losing confidence. 

8.8 In the ideal world of complete contracting, 
there is just one auditor and one judge for an 
entire country who sit idly since no one ever in- 
fringes the law. Their presence is nevertheless 
necessary to guarantee that any infraction will 
be detected, processed and fined. 

8.9 The same applies for a faulty device that 
includes many components (e.g., BP Deepwater 
Horizon oil spill in Louisiana). 

8.10 As we explain in §21 on adverse selec- 
tion, market failures reduce the overall effi- 
ciency of the economy which is why the govern- 
ment seeks to reduce their impact. 

8.11 cf. Chronology of treaties and original 
texts. 

8.12 The former is a potential importer of agri- 
cultural products and is thus reluctant to in- 
stitute the CAP. France and its ally Nether- 
lands, are on the other hand potential exporters 
willing to match the opening of industrial sec- 
tors that favors Germany with the agricultural 



ones. This tense negotiation also cuts back the 
commission's power leaving the council free to 
decide on all support mechanisms. 

8.13 The votes of the member states are 
weighted in principle according to the popu- 
lation size of the individual EU countries but 
the small and medium-sized countries have a 
larger than proportional number of votes to 
safeguard their own interests. 

8.14 The "de minimis" notice on agreements 
of minor importance which do not appreciably 
restrict competition under Article 81(1) of the 
Treaty establishing the European Community, 
OJC 371, 9.12.1997. 

8.15 We use the 2002 consolidated version of 
the 1957 treaty, now known as the EC Treaty, 
where article numbers are changed with re- 
spect to the original. 

8.16 The Sherman Act was also used to prose- 
cute labor unions who monopolized industries. 

8.17 It is frequently applied by the lower fed- 
eral courts to prosecute labor unions who mo- 
nopolized employment in whole industries. 

8.18 Some agreements or behavior are so likely 
to harm competition and so unlikely to produce 
pro-competitive benefits that they do not war- 
rant the time and expense required for an in- 
quiry into their effects. 

8.19 For instance, setting a price range, a com- 
mon list of prices for discounts, agreeing on dis- 
counts or on terms and conditions of sale, lim- 
iting supply and even agreeing on a common 
standard are unlawful. 

9.1 As we explained in §8.3, cartels were re- 
ferred to as trust in the US which in turn ex- 
plains why the struggle against trusts by public 
authorities lead to the now accepted terminol- 
ogy of anti-trust . 



9.2 To simplify, we disregard the issue of fixed 
cost. 

9.3 The ceiling allocations that cover short pe- 
riods are averaged to match the data on yearly 
production. 

9.4 Iraq is not included in the data due to the 
eventful destiny of this country; notice also that 
Indonesia is now a chronic importer of oil. 

9.5 There are three basic levels of licensing con- 
trolled by the government. Under registration, 
one must solely register identity and some per- 
sonal information (possibly for some fee), certi- 
fication requires an examination for minimum 
skills and knowledge and finally licensure is a 
harder and better protected exam for the right 
to practice. 

9.6 The EC nevertheless calls a cartel a group 
of firms participating in a collusive agreement. 

9.7 A related clause is the most-favored-nation 
provision whereby an (international) trading 
partner is promised that his import tariffs will 
be no higher than those offered to other trading 
partners (in the past or future). 

9.8 This can be explained by quality uncer- 
tainty (cf. the lemons problem in §21.1.1), 
transaction costs to set up the resale or low 
residual value if already consumed. 

9.9 The interaction is finite because durable 
goods have a finite live span and MFC or MOR 
clauses do not apply to next generation prod- 
ucts. 

9.10 The last period being used to retaliate, it 
necessarily brings a competitive outcome; this 
is why collusion cannot last until the end. 

9.11 Indirect ways include fixing the distri- 
bution margin, conditioning promotional cost 
reimbursement on price fixing observance or 
threatening to delay or suspend deliveries. 



10.1 Introduction of many variants of a ba- 
sic product like automobile or breakfast cereals. 
Still if variants can be withdrawn cheaply there 
will be entry because nobody believes that the 
incumbent will keep so many substitutes to its 
own base line (cf. §11.2.1). 

10.2 This legal term means to hinder, deter, im- 
pede or handicap. It originally meant the legal 
proceeding initiated by a creditor to repossess 
the collateral of a defaulting borrower. 

10.3 In some formulations the postulate says 
that "the entrant believes the incumbent won't 
react". Such a belief is coherent only if experi- 
ence has shown that incumbents did not react 
to entry (otherwise the entrant would be irra- 
tional). The postulate is thus about the incum- 
bent's behavior and not about the perception of 
others. 

10.4 There might be some technical impedi- 
ments to immediate adaptation but if the in- 
cumbent wants to change his plans, he will be 
able to do so in a near future and since the rele- 
vant profit is the discounted sum of future prof- 
its, the change will be reflected. Thus, we might 
as well assume that marketing changes are in- 
stantaneously effective. 

10.5 Their result is been extended Segal and 
Whinston (2000) by for price discrimination 
and sequential bargaining. 

10.6 Since damages for breach of contract tend 
to be per-se illegal (cf. Posner (2001) p.232), the 
reader must keep in mind that we are dealing 
with take-or-pay contracts. 

10.7 We have pf = v at c = C2 and c\ increasing 
with x. 

10.8 Formally, there is indeterminacy for the 
captive and free prices upon entry since we 
have only one zero-profit condition. Yet, if 
we adopt a minimum price discrimination rule, 



then the captive price remains at x the value it 
must achieve when c = c\. 

10.9 The fact that there are at least two such 
firm competing to get the exclusivity guaran- 
tees that whoever wins, he gets no rents; hence 
the upstream firm reaps the whole monopoly 
profit. 

10.10 The symmetric equilibrium solves q* = 
RUn-Dq*). 

10.11 In EU law it is called Open Network Pro- 
vision. 

10.12 Dupuit (1852), citing older French canal 
legislation passed in a time of "laissez-faire", is 
a forerunner of this position except for railways 
out of security concerns. Oddly enough, France 
now stands in complete opposition to this liber- 
alizing practice. 

10.13 Where prices are set below average to- 
tal (but above variable) costs, some additional 
elements proving the predator's intention need 
to be established in order to qualify them as 
predatory, given that other commercial consid- 
erations, like a need to clear stocks, may lie at 
the heart of the pricing policy. 

10.14 Some countries have non-economical 
motives for industrial policy such as national 
independence, technological autonomy, support 
for declining traditional activities or sustaining 
geographical and political cohesion. 

10.15 Imitative entry occurs through manage- 
rial and labor turnover (former employee start- 
ing their own business). Knowledge is non-rival 
and often non-excludable which leads to posi- 
tive technical externalities and increasing re- 
turns. Furthermore, new knowledge tends in 
the aggregate to complement existing knowl- 
edge. 

11.1 The only exception to this rule has to do 
with quality differentiation; one can indeed find 



door to door, a discounter, a standard supermar- 
ket and a delicatessen shop. 

11.2 Decades after Hotelling's pioneering work, 
d' Aspremont et al. (1979) exhibit a flaw within 
his original linear city model. Fortunately, 
Economides (1986) shows that the flaw disap- 
pears as soon as they are several geographical 
dimensions which in our basic story amounts to 
add North and South as possible locations for 
firms. 

11.3 If consumers are not uniformly dis- 
tributed over the street but according to the cu- 
mulative distribution function F{x) then profit 
is 7r a = PaF(x) so that the FOC of best reply is 



Pa fix) 
2{b-a) 



since x solves pb~Pa = [a+b-2x)[b- 



F{x) 

a). Using the symmetrical formula l-F(x) = 
2(i,la) > we derive the indifferent consumer as 
an implicit function of the location midpoint 

a+b _ -._ l-2F(x) 
2 X f(x) ■ 

11.4 Imagine, after Jean Francois Mertens, 
taking the [0; 1] segment and bending it so as to 
join the extreme 1 and values to form a ring. 

11.5 The proper analysis requires to consider 
asymmetric locations, work out demands, price 
best responses, equilibrium and compute prof- 
its as a function of neighboors locations. One 
can then proceed to show that a firm fine tunes 
her location to be just in between her two neigh- 
bors. In equilibrium of this process symmetric 
locations obtain. 

11.6 cf. regional economics course 

11.7 It can be assumed competitive because 
there are many individual buyers in the city 
and many supplying farmers around. 

11.8 Although land cultivation has decreasing 
returns to scale, we assume constant returns to 
scale for family production. 

11.9 All the economic surplus of cultivation 



goes to landlords which is why it has been 
called a rent by economists. 

11.10 If demand for good i increases, so does 
its price; the bidding curve then rises and the 
share of land dedicated to this crop increases to 
satisfy demand. 

11.11 If it was not so, unhappy places would be 
vacated, forcing down the local housing price. 

11.12 Formally, -P- = =r < by the envelope the- 

UX Cf x 

or em. 

11.13 Suppose the price falls so much that 
she may maintain her initial bundle of con- 
sumption and therefore her initial utility. She 
would then consume more housing as it is now 
cheaper (cf. equality of MRS to relative price in 
§2.2.1) and be happier, but this cannot happen, 
so that the price must increase. Meanwhile this 
process takes place, she reduces her housing 
but never back to what she could afford initially 
so that in the end, her dwelling is larger (use a 
graphic to check this). 

11.14 As w increases, q x rises, thus the slope of 
the bidding curve -f- also rises. 

11.15 Executives refer to it as cannibalization; 
it typically occurs when a new model is intro- 
duced while the old one is still for sale although 
they are priced differently. 

11.16 With m attributes normalized on a [0;1] 
scale, the opportunity cost is £,-< OT ttixt - Z{) 2 . 
Yet, our ability to aggregate complex informa- 
tion is limited by our cognitive power to the 
point that we value each lower rank attribute 
twice less than the previous one and so on; 
this implies that the most important attribute 
weights as much as all the rest and we may 
safely continue with two dimensions, a strong 
and a weak one. 

11.17 Either the representative agent values 
diversity through the composite good or y is the 



aggregate utility for different people consuming 
different brands. Authors also assume unit in- 
come elasticity to avoid income distribution is- 
sues in their general equilibrium model. 

11.18 A lawyer who can type very fast will nev- 
ertheless let his secretary do that job to concen- 
trate on legal studies where his skills are best 
rewarded. 

11.19 On the one hand, he downsizes this 
model by performing a partial equilibrium 
analysis with a = 1 but on the other hand, he al- 
lows for decreasing demand elasticity, thereby 
making the optimal price increasing with out- 
put (like the competitive supply function of a 
DRS technology). Notice that this obscuring 
feature fails to add anything to his findings. 

11.20 In Krugman (1979)'s model with de- 
creasing demand elasticity, increasing labour 
endowment leads to more variety, higher wel- 
fare, smaller per-firm output, thus a lower price 
or equivalently a higher wage. If trade is pro- 
hibited but workers are mobile, they'll all go to 
the large country that already enjoys more va- 
riety and higher wages. Thus industrial con- 
centration might occur. 

11.21 The maximal quality achievable is 
bounded by the technological state of the art. 

11.22 For i = A,B, j denotes the other firm. 

11.23 Danone, an important food producer was 
in the 1980s just one yogurt brand of a multi- 
activities conglomerate without much public 
exposure. In 1994 it decided to change its 
name to Danone and concentrate on quality 
food. Sales growth have been accompanied by 
much advertising. 

11.24 Nestle which was already the world's 
largest and most diversified food company in 
2000 is pursuing in 2003 a clear strategy of con- 
centrating on high-end food production by di- 



vesting non-core business and buying famous 
brands in the US where its market share is still 
much lower than in Europe. 

11.25 cf. Brandz report on firms with highest 
levels of goodwill. 

11.26 Since profit is concave in price, the sign 
of^isthatofgL. 

12.1 Among other things, it also contributed to 
stop whaling that was formerly a prime source 
of oil. 

12.2 The figure expressed in year 2000 dollars 
uses a capitalization rate of 11% representative 
of the industry's profitability. Such a discount- 
ing is made necessary by the fact that the test- 
ing phases generally last more than a decade. 

12.3 A reinforced cooperation is called a Con- 
sortium; it is made by a group of independent 
companies working together for the fulfillment 
of a specific project like a building or a motor- 
way which requires close cooperation between 
engineering, planning and construction. A still 
greater form of cooperation is the joint venture. 

12.4 Reverse engineering is the process of ex- 
tracting knowledge from a human-made arti- 
fact i.e., starting with the known product and 
working backwards to divine the process which 
aided in its development or manufacture. In- 
ternational treaties on the protection of trade 
secrets are silent with respect to it but the soft- 
ware industry has recently succeeded to obtain 
protection in the US and Europe to fight piracy. 

12.5 The iPod-iTunes bundle also paved the 
way to the legal selling of digital music over the 
internet. 

12.6 Ideally, the patent office should compute 
the duration that enables the developer to re- 
coup exactly his R&D costs but this would put 
the office under the pressure of lobbies (in- 
dustry, consumers and government) with the 



likely outcome that incumbent wealthy indus- 
tries would get lengthier patents than the inno- 
vative and fragile industries which most need it 
(from a social point of view); hence it is more ef- 
ficient in the long run to have the same fixed 
duration for all inventions whatever the sector 
they come from, even-though this may appear 
sub-optimal at first sight. 

12.7 Aspirin is one of the few exceptions; it was 
invented more than a century ago and is still 
marketed as an original product by its patent 
holder Bayer. Notice that the name remains a 
trademark of Bayer. 

12.8 Recall from §6.1 that there is excessive 
entry in most oligopoly models. 

12.9 Technically, the profit at the optimum 
is Tlf satisfying ^k. = -D(pf) by the en- 
velope theorem. Integrating, we get V M = 
fcD{pM) dx < V* since pf > c. 

12.10 Technically, we have D(c) < D{c) over [c;c] 
hence V B = f°D(c)dp< / c a D{p)dp = V* . 

12.11 The incumbent in an oligopoly with n > 2 
prefers ad-valorem royalties only if she can ne- 
gotiate with the entire industry. If she has to 
negotiate separately with each firm, she prefers 
per-unit royalties. 

12.12 Notice that 8 > 6 => m < 1 which is why 
we refer to such a situation as a drastic inno- 
vation since it allows a sole firm enjoying the 
innovation to exclude all obsolete ones. 



12.13 More specifically, m F <non\ 



2(m+l) 



and 



m F < mo n<2m-2. Recall also that we are ig- 
noring the integer problem i.e., the fact that m 
is an integer. 

12.14 This is only correct for m < n since other- 
wise auction and fee are equivalent. 

12.15 Unless they are subsidized but this will 
not happen. 



12.16 Although the exact number of licenses 
is indeterminate, it is never n because doing so 
reduces the auction to a fee setting which we 
know to be dominated. 

12.17 The Fronde revolt against the French 
king in the XVII th century was in part trig- 
gered by his selling additional offices and li- 
censes which enraged the current holders of 
these prerogatives. 

12.18 The same problem occurs for art print- 
ing. The printer of a lithography is supposed to 
break the copper mold of the original once the 
scheduled number of copies have been printed. 

12.19 In settings where the upstream firm sells 
a conventional input, she can build commit- 
ment by foreclosure (cf. §10.3) or vertical re- 
straints (cf. §9.2.2). 

12.20 If after entry, firms compete a la Ber- 
trand instead of Cournot, the condition reads 
nf + nf < nf where Ylf = and nf = V B . It is 
satisfied because V B is the monopolist's profit 
when pricing at c which is not the monopoly 
price. 

12.21 If there is rivalry among at two chal- 
lengers, a model of contest or conflict is needed 
to assess how much they would spend (cf. §7). 

12.22 Gilbert and Newbery (1982) consider a 
time-continuous framework with discount fac- 
tor r starting today (date 0). If she innovates 
first at date t, the challenger earns, in present 
value, V e [t) = f^°U^e~ rt dt while the incum- 
bent earns V\t) = J f nf e~ rt dt + / f °° n ? e~ rt dt. If 
he innovates first at date t, the incumbent 
earns, in present value, V (t) = J^\\fe~ rt dt + 
f^°U^ 1 e~ rt dt. Hence the values for innovation 
are V 1 = v'^-V^t) = / f °°(nf -nf] e~ rt dt and 
V e =V e {t) for the equilibrium date t. The differ- 
ence v l -v e = / t °°(nf -Ilf-n^e- rt dt is thus 
the integral from t to infinity of the instanta- 



neous difference in producer surplus between 
monopoly and duopoly. 

12.23 Management methods such as profit 
sharing and bonuses linked to the attainment 
of concrete objectives also display DRS since 
they force employees to bear risk; they thus be- 
come increasingly more expensive as their scale 
and breath increase inside the firm (cf. ch.20 on 
Moral Hazard). 

12.24 As innovation makes firm A stronger, 

dz* 

it forces competitors to retreat i.e., ^- < 
whether z is price or quantity. Now recall that 
j^>0 (cf. eq. 5.16) while §g < (cf. eq. 5.2), 
thus the combined sign of the strategic effect. 

12.25 This is done to simplify the exposition 
with the purpose of presenting the case oppo- 
site to Bertrand where there cannot be any 
market sharing. 

12.26 The specific knowledge on a product or 
production process, often obtained through ex- 
tensive and costly R&D. It is deemed to be 
a body of technical information that is secret, 
substantial and well identified (EC definition). 

12.27 The term originally denoted royal de- 
crees granting exclusive rights to certain indi- 
viduals or businesses, much as a charter, the 
ancestor of the corporation, bestowed rights to 
some institutions. 

12.28 There are older cases but they are more 
akin to privileges. In the Greek city of Sybaris 
in Sicily, 500BC, the king would grant a one 
year exclusivity to chefs who had invented a 
new recipe. Concessions for underground ex- 
ploration and mining were granted for a lim- 
ited period of time without ownership on the 
land itself. Privileges consisting in 5 to 20 years 
of exclusivity were awarded by cities to for- 
eign craftsmen accepting to settle there to per- 
form their art and to train local workers (e.g., 



Brunelleschi patent for marble transport on the 
Arno river, granted by Florence in 1426.) 

12.29 The Uruguay Round Agreements on Mul- 
tilateral Trade Negotiations of 1994 has set the 
duration to 20 years (it was previously 17 years 
in the US). All WTO member countries apply 
this rule. 

12.30 US patent #4,873,662 for "an informa- 
tion handling system and terminal apparatus 
therefor" was filed in 1980 and awarded in 
1989. In 2000, the owner British Telecom, sued 
some large US internet access providers claim- 
ing that its patent covered web browsing and 
hyperlinking. The claim was demised by courts 
in 2002 given the evidence of older hypertext 
projects. 

12.31 US patent #4,698,672 "for reducing the 
quantity of data in image coding" was awarded 
in 1989. The current owner Forgent Networks 
sued large computer makers in 2004 for al- 
legdly using the patent in their JPEG graph- 
ical format. Similarly, in 1996, Unisys and 
CompuServe began seeking royalties on their 
patented LZW compression method, a key el- 
ement in the GIF graphic file format; it was 
quickly replaced by the free PNG. 

12.32 A similar lawsuit against Daimler- 
Chrysler was dismissed late 2004. 

12.33 US patent #6,460,020 for an interna- 
tional transaction system over the internet cov- 
ering catalogue, currencies, taxes, shipping cost 
and addresses was filed in 1997 and awarded 
in 2002 to a start-up DE Technologies who sued 
the computer manufacturer DELL in late 2004. 

12.34 US patent #5,960,411 was filed in 1997 
and awarded in 1999 to Amazon for 1-click pur- 
chase described as "a method and system for 
placing an order to purchase an item via the 
Internet". Once the buyer has registered him- 



self with the seller, the later sends a "cookie" 
to the buyer's computer to ease future transac- 
tions. The idea is identical to opening a bank 
account, obtaining a credit card and using the 
later in a shop afterwards. Amazon sued im- 
mediately Barnes & Nobles for devising a com- 
puter code implementing the very same idea; 
the case was settled privately. 

12.35 US patent #6,782,370 covers a "System 
and Method for Providing Recommendation of 
Goods or Services Based on Recorded Purchas- 
ing History" was filed in 1997 and awarded in 
2004 to Cendant Corporation, a hotel and car 
rental company, that immediately sued Ama- 
zon for its practice of recommending books to 
viewers. 

12.36 The original convention of 1886 was 
changed into the Universal Copyright Conven- 
tion in 1952 at Geneva and revised in 1971 at 
Paris. 

12.37 The author agrees with US federal court 
judge Posner that the sheer majority of works 
which have a very low value should be made 
freely available to constitute the base of new 
creations. The very fact that a formal (even 
free) agreement must be obtained from the 
copyright holder to use his work creates trans- 
action costs (cf. §13.3.3) thwarting the creation 
process. He also remarks that "adding 20 years 
to the already long 50 years copyright term has 
only negligible effects on the incentive to cre- 
ate a copyrighted work in the first place. Who 
would refuse to create a work unless he could 
count on his heirs' receiving income from it 
more than 50 years after his death?" 

12.38 Note: a "brat" is spoiled child. As ex- 
plained by Duncum (2007), the reason why 
Bratz captured a large market share within 
a few years of its introduction (unlike previ- 



ous attempts over the 50 years reign of bar- 
bie) is "age compression", the fact that modern 
girls become preoccupied with adult life sooner 
and thus leave traditional dolls sooner (aka 
Kids Getting Older Younger). The more trendy 
Bratz, then, offers a bridge between childhood 
and adolescence (cf. also the staggering success 
of the Titeuf comic (1M books/year) geared at 
the same age category). 

12.39 The jury did not award any punitive 
damages and found that MGA acted willfully 
when employing the designer. 

12.40 The argument of lower quality some- 
times used to bar import from poor countries 
to richer ones does not apply here since a US- 
made pair of Levis is THE genuine product. 

12.41 Region codes correspond to the follow- 
ing areas of the globe: 1 for Canada and USA, 
2 for Japan, Middle East, South Africa, West- 
ern Europe, 3 for East Asia, Southeast Asia, 4 
for Pacific and other americans countries, 5 for 
Africa, Eastern Europe, India, Mongolia, North 
Korea, 6 for China and 8 for airplanes, and 
cruise ships. 

12.42 The difficult coding of the singer's 
voice was first tested on Suzanne Vega's song 
tom's diner, encoded here with Advanced Audio 
Coding at 20 kbits/sec i.e., 1112 th of the space 
needed on a CD. 

12.43 In a memorable quote in front of the US 
congress, the head of the MPAA said, "the VCR 
is to the American film producer and the Amer- 
ican public as the Boston Strangler is to the 
woman home alone." 

13.1 Our data is limited to the US and France 
as Eurostat does not yet compile exhaustively 
this information. The VA percentages are ob- 
tained as the ratio of value added (inside firms) 
to gross output (market sales of final products). 



The sectors differ in the two tables for statisti- 
cal institutes have a differing focus of analysis. 
The sectors weights, as % of GDP, are reported 
in table 1.1. 

13.2 Incorporation is the legal process through 
which a company receives a charter from the 
State that allows it to operate as a corporation. 
The articles of incorporation describe the regu- 
lations that govern the running of the company, 
they include the main business and purposes 
of the company, the shareholder's voting rights, 
the directors duties, the general working and 
management practices. 

13.3 This is called a single tier system; there 
also exists a two-tier system where stockhold- 
ers elect a supervisory board beside the admin- 
istrative one. 

13.4 Basing compensation on performance is 
one frequent form of contest (cf. §7). 

13.5 Notice that once docility is present, soci- 
ety may exploit it by teaching values that are 
truly altruistic; that is, which contribute to the 
society's fitness, but not to the individual's (like 
going to war singing). 

13.6 In not-for-profit organizations like pub- 
lic administrations, the authority structure and 
the prestige associated to high levels make the 
influence cost inefficiency particularly binding. 
The rational response is thus bureaucracy, a set 
of rigid rules that sometimes do not make sense 
at the individual level but overall protect the 
institution from abuses. 

13.7 This meritocratic conception is directly 
inspired by the Chinese mandarin system, dat- 
ing from the 7 th century whereby public service 
positions were filled through written examina- 
tion open to all males. This scheme, as well as 
knowledge of the Chinese political and econom- 
ical system was brought to Europe by Jesuits 



during the 17 f/j century and strongly influenced 
the enlightenment thinkers. Adoption occurred 
during the 18 f;! in France (early through the 
revolution), England and the US. 

13.8 It must be noted that during the middle 
ages, official positions were few compared with 
the aristocratic population (itself very small). 
To fill in the positions awarded by rulers, fam- 
ilies somehow competed with their most able 
siblings thereby guaranteeing a minimal mer- 
itocracy. As the State grew in size and became 
involved with technical activities (war machin- 
ery, boats, roads, canals), adequate education 
became indispensable; it lead to the creation of 
specialized graduate schools. 

13.9 It is customary to refer to functions as 
units and thus to the organization as the "U- 
form". The organization around products is 
said to be multidivisional, so that one speaks 
ofthe"M-form". 

13.10 A likewise dichotomy seem to apply to 
China vs. USSR when seen as large centralized 
economic producers. Whereas the defunct So- 
viet economy consisted of dozens of specialized 
ministries, China comprises a large number of 
reasonably self-sufficient regions. 

13.11 For instance, Chakravarty and MacLeod 
(2009) analyze the case of building contracts in 
the US and show they are based on a standard 
template with only a few terms modified to deal 
with special circumstances. 

13.12 There exists a theory of complete con- 
tracts stemming from the even more abstract 
implementation theory which proves that com- 
plex contracts can achieve almost anything in 
ideal environments (including the first-best). 
As soon as one of the previous issues start 
to bite, complete contracts becomes less useful 
and may be dominated by simpler ones. 



13.13 Many specific bargains are cheap to 
strike, but since most relationships involve a 
large number of details, the small costs can add 
up to a substantial level if we are to negotiate 
on everything. 

13.14 Like all in behavioral works, the theory 
is fairly simple since there is no strategic inter- 
action, only external social forces that push in- 
dividual in a certain direction; the force of this 
approach lies in its empirical validation that 
purely theoretical works lack. 

13.15 Interestingly, the norm is not imposed 
by society upon the parties but endogenously 
designed so as to fit their objective. 

13.16 Notice that such a scheme requires a 
common cultural understanding in order for the 
chosen norm to have the same meaning to each 
party. 

13.17 Gneezy and Rustichini (2000) explain 
how parents are usually ashamed when arriv- 
ing late to pick-up their children at school be- 
cause the master has a duty to stay (without 
extra pay). Once fines for latecomers were in- 
troduced, many parents started arriving ex- 
tremely late, happily paying the price; they had 
turned the school into a rather inexpensive day- 
care center. 

13.18 Mathematically, the joint payoff is a con- 
vex function of the degree of complexity so that 
the maximum is achieved at a corner. 

13.19 Most people take pride in doing the right 
thing and do not suffer from the extra effort it 
may require wrt. the legal minimum. 

13.20 It may even be an informal contract en- 
forced by social pressure. 

13.21 The tendency to believe that one brings 
special skills to the relationship, or has in- 
vested into human capital specific to it and de- 
serves to be rewarded for these things. 



13.22 The empirical analysis also uncovers a 
new behavioral force: ex-ante competition legit- 
imizes the terms of the contract, and aggrieve- 
ment occurs mainly about outcomes within the 
contract and not about the contract itself. 

13.23 Of course, if the vehicle hovered over 
Africa, rural China or India, the green ar- 
eas would be much smaller, and there would 
be large spaces inhabited by the little black 
dots we know as families and villages. But 
the red lines would be fainter and sparser in 
this case, too, because the black dots would be 
close to self-sufficiency, and only partially im- 
mersed in markets. This implies that the study 
of rural economies cannot be performed with 
the standard framework that was developed by 
economists of the industrialized countries to re- 
spond to the preoccupations of these specific so- 
cieties. 

13.24 The changes at this level are either 
peaceful and slow like the EU construction 
or occur suddenly through revolutions (e.g., 
French Republic or USSR) or in the aftermath 
of wars (e.g., US independence or post 1945 
Japan). 

13.25 The primary purpose of institutions is 
to reduce uncertainty by establishing a sta- 
ble framework (but not necessarily efficient) for 
exchange. The ability of a society to main- 
tain social peace and foster economic perfor- 
mance is now widely linked to the choice of its 
institutions such as political regime, customs, 
laws and economic regulations (cf. North et al. 
(2007, 2006) for a synthetic introduction). 

13.26 If it was possible to write contracts cover- 
ing every meaningful future event then no lit- 
igation would ever occur. Indeed, if one party 
tries to act opportunistically, the other party 
calls on to the judge; the latter is able to ver- 



ify what is happening, look at what the contract 
says and forces the wrongful party to perform. 
Anticipating, this deterministic course of event, 
each party is compelled to respect every single 
aspect of the contract. As a result, courts are 
inactive. 

13.27 If the managerial effort e (counted in €'s 
of opportunity cost) participates to decrease the 
cost C(q, e) then the manager-owner will expand 
the effort e* solving |j = -1 which is the effi- 
cient effort at the society level. 

13.28 For instance, money (coins, bill, bank 
check, credit card) eliminates the transaction 
cost of barter which requires a double coinci- 
dence of wants. 

13.29 We deliberately use the vocabulary of in- 
centives and agency as it is the theory that will 
be used to shed light on this issue in the next 
chapter. 

13.30 Nickname of perquisite, an extra al- 
lowance or privilege enjoyed as a result of one's 
position. 

13.31 This would be the case if the parties 
can use a rigid mechanism to share any ben- 
efit since they will be unable to influence the 
division. 

13.32 This author thought that negotiation 
would resolve efficiently this issue but failed to 
understand the importance of sequentiality and 
commitment. 

13.33 The human capital of employees is not 
transferable as slavery is not an option. How- 
ever employment contracts customarily include 
a clause specifying that the output of intellec- 
tual activity like software code or industrial 
designs remains the exclusive property of the 
firm. 

13.34 Leontief (1946) compares these two 
schemes and elegantly demonstrates that the 



price-only negotiation is akin to standard 
monopoly pricing, thus inefficient, while the 
price-quantity negotiation is akin to a perfect 
discrimination, thus efficient (cf. §4.1.3). 

14.1 For instance, most companies contract of- 
fice cleaning to specialized firms. 

14.2 Marshall (1890) agreed with the collec- 
tive rationality argument but claimed that the 
price mechanism would avoid indeterminacy 
even though he failed to explain clearly how 
and why. 

14.3 The error is probably due to the obsession 
with the market price mechanism that is still 
widespread in the economics profession. 

14.4 Reversing the roles amounts to let the 
buyer set the price and then allow the seller to 
choose the quantity. The seller thus maximizes 
pq-C{q) and demands the competitive quantity 
s{p) solving p = C m . The buyer then maximizes 
R{s[p)) - ps{p) = R(q) - qC m {q) using the change 
of variable q = s{p) o p = C m {q). The FOC is 
thus §f = §§ + <?§^->§§ since cost is convex; the 
buyer then selects a quantity lesser than the 
optimal one q. 

14.5 When n s = F-C{q), g- = -C k = g and 
likewise n B = R(q)-F implies that j^ = R k = -^. 

14.6 The original reason why S and B engage in 
a bilateral relationship is because they expect 
the complex design of the item to be worthwhile 
i.e., the surplus 8 is greater than the sum of de- 
fault profits (cf. footnote 26.3 for a graphical 
illustration). 

14.7 Notice that any proportion A e]0;l[ would 
support the results to come. 

14.8 Alternatively, one party deposit a financial 
hostage that does not produce interest; the re- 
lease occurs when the trade is agreed so that 
the party becomes impatient and loses most 
bargaining power. 



14.9 Thanks to their close relation each can ob- 
serve the action chosen by its partner but not 
a court as would be required to enforce a con- 
tract stipulating a specific level. More on this 
in §20.1. 

14.10 The proper theory of incentives is the ob- 
ject of ch. 20. 

14.11 A unit boss may be biased toward 
its workforce because sustained contact with 
workers fosters friendship and empathy. Hence 
she cares about the satisfaction of her subordi- 
nates. 

14.12 The CEO of a large corporation has a 
wage linked to performance but he hardly owns 
more than 1% of its firm's stock. 

14.13 Some targeted investments can be writ- 
ten into a contract, but human resources dedi- 
cated at learning one's partner needs or getting 
more flexible, will always escape control or au- 
dit at a reasonable cost. 

14.14 We only consider this case which is the 
most frequent to avoid adding another design 
whose symmetry would not yield any significa- 
tive additional insight. 

14.15 We use the principal-agent terminology 
of part H. 

14.16 The architect of the Sydney Opera House 
did not foresee the need for air conditioning. 
Since this omission was discovered lately and 
standard noisy units were out of the question, 
an expensive ice hockey floor was installed as a 
means of cooling the building, (borrowed from 
Preston McAfee). 

14.17 We mean that he will rationally take the 
opportunity to advance his objective, profit. No 
moral connotations is involved. 

14.18 We assume F e < V in order that continua- 
tion of the project be always the efficient action. 



14.19 For a convex design cost d with |^ > 0, we 
still get T a > t a while the comparison n FP > n c + 
involves d(T a )- d{f a )- F(r a -f a ) + (l-T a )a{V - F) 
which, using super modularity, is shown to in- 
crease with a. 

14.20 This feature of modern corporations is at 
the root of some large bankruptcies (or public 
rescues when "too big to fall"). 

14.21 Regulation is optimal provided than the 
social damage is large enough so that the sav- 
ings on damages overcome the loss on safe firms 
forced to adopt the standard. Alternatively, it is 
enough to assume that risky firms are numer- 
ous enough. 

14.22 Contracting upon the input K is obvi- 
ously even more difficult (i.e., costly) than us- 
ing output. Since q and K are in a one-to-one 
relationship, it would be a dominated strategy. 
Introducing uncertainty as we do in §20.2 could 
make it a worthwhile consideration. 

14.23 The model makes sense only if the cost 
function c{.) is convex, includes a fixed cost and 
K <K*. 

14.24 Our setting is formally equivalent to 
short term vs. long term cost when considering 
capital as a flexible or inflexible factor. 

15.1 The EU Treaty has no provision for 
merger so that the Court of Justice judged in 
1973 that there is abuse of a dominant position 
if an undertaking already holding such a posi- 
tion strengthens it by acquiring a competitor. 

15.2 Note though that some very large firms 
remain in private hands i.e., away from regula- 
tors and stock markets. 

15.3 We count the percentage change between 
the first two years and the last two years. 

15.4 Forbes' ranking gives equal weight to 
rankings on sales, profits, assets and market 



capitalization. We use the same method as in 
the above footnote and compute % up and down 
before taking the top tiers for each. 

15.5 Their ranking is solely based on revenue. 

15.6 Excluding firms with less than three years 
in the list; small firms have less than 20 thou- 
sand employees. 

15.7 Contrary to our standard notation, sub- 
scripts refer here to the number of active firms 
not the label of a particular one. 

15.8 This can be achieved by linking their wage 
bonus to their brand's profit and not to the par- 
ent company overall profit. 

15.9 They even compete in the world rally 
championship (WRC) for image building. 

15.10 A conglomerate holding k brands out of a 
total n - 1 finds it profitable to introduce a new 
brand if kn n -i < (k+ l)n n o k < 2+ " /n , roughly 
meaning that it should not control more than 
half of the brands, which is true in the Euro- 
pean car markets as can be seen in Table 15.18 
which contains 18 firms to which we can add 3 
brands from VW, 2 from Fiat, 2 from Daimler 
and 1 from PSA. 

15.11 The intuition lies in the fact that individ- 
ual profit is convex increasing in output, thus 
convex decreasing in marginal cost. This means 
that every firm is a risk-lover when it comes 
to draw a technology from a distribution with 
fixed mean (cf. Fevrier and Linnemer (2004) for 
a generalization). 

15.12 It turns out that in our simple model, 
production reshuffling and knowledge diffusion 
are identical. While the former is a produc- 
tive efficiency, the latter points at dynamic ef- 
ficiency. 

15.13 The first step to show that the combined 
post merger output <7i + <?2 is lesser than the sum 



<7i + q 2 of pre-merger outputs. We use two facts, 
firstly the best reply function b(.) in the Cournot 
model is decreasing at less than a one-to-one 
rate and secondly the merged firm reduces both 
of its plants' outputs below their unmerged best 
responses since it internalizes the externality 
that each plant's output has on its other plant: 
cji < Wqj + q-12) for i = 1,2 where q-u is the 
pre-merger aggregate output of the outsiders to 
the merger. Three cases appear: If qi < qt for 
i = 1,2 then the result is trivial. If qt > qi for 
i = 1,2 then the second observation implies qi < 
b{qj + q-i 2 ) < b{q~j + q-u) = qt for i = 1,2, a con- 
tradiction. Lastly, if q x < q x but q 2 > <?2, then by 
the first observation, qy + q 2 < c\\ + b{q x + q-u) < 
q\ + b[q\ + q-12) = c\\ + (J2, the desired conclusion. 
The second step uses again the first observation 
but applied this time to the rest of the indus- 
try to conclude that overall output has shrunk: 
Q = qi + q 2 + b(qi + q 2 ) <qi + cj2 + Hq\ + qi) = Q- 

15.14 To ensure that all firms participate, we 
need only assume that the worst does. 

15.15 The "real" entropy is H = e Y ; in physics 
its measures the amount of "disorder" of a sys- 
tem. 

15.16 Under the approximation of §15.3.4, we 
have St = y -f for all i < n with 7 « 577 + ln(n) , thus 
H = y^T-isn i~ 2 -Y 2 n x \ which computed in Ta- 
ble 15.13 for a sample of market sizes. 

15.17 Sales data, in thousands, come from the 
European Automobile Manufacturers Associa- 
tion. Notice that the name correspond to the 
industrial group; it may contain several brands. 
For instance, Rover and Land-Rover belonged 
to BMW in 1992; the former went independent 
again while the latter is now a Ford brand. 
Volvo is now also a Ford brand. 

15.18 It is called survival function when X des- 
ignates a time. 



15.19 A random variable with zero mean and 
unit variance. In the simplest case of the ran- 
dom e t = ±1 with equal probability while in 
the continuous time notation, the temporal evo- 
lution is modeled by geometric Brownian mo- 
tion: 4jL = [idt + adw where /i is the systematic 
component (trend) and d w the increment of a 
Wiener process. 

16.1 We adapt Boadway (1997)'s panorama as 
well as the terminology of public economics to 
our industrial organization perspective. 

16.2 Smith (1776) emphasizes the role of the 
State for some public works and institutions, 
without substantiating much the eligibility cri- 
teria. 

16.3 Musgrave (1957) originally refers to 
"merit goods" but the literature also uses la- 
bels such "mixed goods", "quasi-private goods" 
or "specific egalitarianism". Consumptions 
deemed immoral (e.g., gambling, prostitution) 
or harmful (e.g., drugs, tobacco, junk food) are 
demerit goods whose treatment is opposite. 

16.4 Again, the criteria for defining these ex- 
press value judgments at society's level. 

16.5 More precisely, Samuelson (1954) use two 
characteristics: absence of rivalry in consump- 
tion and the (practical) impossibility of exclu- 
sion. 

16.6 Conversely, it would appear that for de- 
veloping countries, every dollar spend on the 
rule of law is worth the while; the problem is 
whether the political elite think likewise. 

16.7 Interestingly, the century long experience 
of the welfare state in the OECD shows that the 
key factors for its emergence are democratiza- 
tion, aging population, prosperity, openness to 
trade, and an homogenous culture. 

16.8 Our emphasis on the mostly developed 
member countries of the OECD is driven by 



the availability of historical statistics (cf. also 
Schuknecht and Tanzi (2005)). 

16.9 Social security funds is included on top 
of central, state and local governments rev- 
enue (cf. Government Finance Statistics). Since 
expenditure rise to 47.4% (resp. 46.6%) of GDP, 
deficit is around 2% of GDP. 

16.10 Core mission includes general services 
(legislative organs, central ministries), public 
order, defense and environment but without 
debt interest. Although economic affairs in- 
clude the regulation of industries, most of the 
amounts relate to support programmes, subsi- 
dies and public infrastructure spending. 

16.11 Defense, in advanced economies, serves 
geo-strategic interests rather then protection 
against invasion. Debt whose service is be- 
tween 2 and 4% of GDP represents a burden 
shifted onto future generations by current lead- 
ers to enable larger expenses on redistribution. 

16.12 If my neighbor patrols the neighborhood, 
he protects his house and mine at the same 
time. I have no incentive to reward him for the 
lost time and he cannot threaten not to protect 
my house since that would leave his own unpro- 
tected; thus, unless I am compelled by a sense 
of duty, I shall free ride on his effort. 

16.13 It is our opinion, from reading the lit- 
erature, that proponents of this view seek ev- 
ery possible model (even feeble ones) to justify 
the current role of the State in a manner that 
avoids value judgment so as not to reveal their 
left-wing agenda. 

16.14 As noted by Shleifer and Vishny (1994), 
the market socialism controversy, quite active 
during the first half of the XX f ' ! century, fell 
into the same trap. Discussing the pros and 
cons of an economy based on prices (market) 
or quantities (socialist command-and-control) 



does not make a lot of sense if one must as- 
sume that both a socialist or a capitalist gov- 
ernment would want to advance the welfare of 
its citizens given the overwhelming evidence to 
the contrary (cf. §16.2.4). 

16.15 Their interactions is however assumed 
to take place in a free country i.e., one where 
violence is not an option. 

16.16 This school of thought does not hide its 
right-wing political agenda towards shrinking 
the State. The sincerity is appreciated although 
it makes it hard to disentangle objective find- 
ings from mantras. 

16.17 Their analysis is for the US but seem to 
apply fairly well to other western participative 
democracies. 

16.18 This is a libertarian view of the "good" 
society inspired by the history of local commu- 
nities in the early days of the US. 

16.19 The creation of a common agricultural 
market in Europe responded to the need to 
establish compensatory measures to smooth 
out the initial large productivity differences 
between member states agricultural sectors. 
Since then, the program has remained, gener- 
ated perverse effects and appreciable welfare 
losses. However, little of that 50bn€ rent is dis- 
sipated in rent-seeking activities, thus this pro- 
gram is mostly redistributive although its fair- 
ness is more and more questioned. 

16.20 In the same fashion, Glaeser and Shleifer 
(2003) recall that Enron's collapse provoked a 
public uproar and a call for new accounting 
rules. The new law was later influenced by the 
lobbying of industries but it seems clear that 
no one provoked the Enron scandal as a mean 
to get a new favorable regulation. 

16.21 Since half of the global war effort is 
defensive, a small percentage of conquerors is 



enough to generate frequent conflicts. 

16.22 This phenomenon is more acute in Eu- 
rope because the demise of the Roman Em- 
pire left more political fragmentation than else- 
where. Indeed, when every territory is sur- 
rounded by enemies, rivalry is bound to be 
fiercer so that military edge acquires a greater 
value which in turn propels military innovation 
(cf. similarity with R&D patent races in §12). 
This also explains why Europe was able to col- 
onize the rest of the world once it . 

16.23 cf. Rabelais (1534)'s Coin is the sinews of 
war and Baumol (2004). 

16.24 Epithets often adjunct to the word State 
are Leviathan, rentier, autocratic, predatory, 
proprietary, grabbing or exploitative. 

16.25 As recalled by Glaeser and Shleifer 
(2003), the US at the times of the Gilded age 
came to be dominated by a small economic and 
political elite, operating the country in a man- 
ner akin to rent-planning. The progressive era 
that ensued with the empowering of the fed- 
eral (central) government can be understood as 
a restoration of public order and a leveling of 
the competitive playing field. 

16.26 For instance, Besley and Persson (2009) 
recall that income taxation was created in coun- 
tries on the verge of war. 

16.27 Notice that the revolutions in Eng- 
land, France and USA predating the advent 
of democracy were all triggered by conflict be- 
tween ruler and council w.r.t. a tax increase (as 
well as demands for political rights). 

16.28 Quite often, the ruler starts by prohibit- 
ing an activity (on moral grounds) in order to 
later sell selective exemptions. 

16.29 As recalled by Acemoglu and Robinson 
(2000), this goal was achieved by forcing the 



traditional ruling elite to accept more and more 
voting members from the bourgeoisie. 

16.30 Policies favoring exports and limiting im- 
ports practiced by all European countries. It 
lasted longer in France or Spain as compared 
to the Netherlands or England because parlia- 
ment in the latter countries stripped the ruler 
of his/her granting privileges sooner. 

16.31 According to Irwin (2010), this US doc- 
trine retaliated UK discriminatory trade prac- 
tices. Bairoch (1972) shows it has been prac- 
ticed by all advanced economies well into last 
century. 

16.32 Colonialism, which involves military oc- 
cupation or its threat (cf. gunboat diplomacy), 
is not motivated by rent-planning alone but has 
undoubtedly the effect of forcing foreign occu- 
pied markets to open to national industries; it 
basically negates the colonized the right to ex- 
ercise protectionism themselves. 

16.33 The UK, paragon of free trade, had many 
binding such restrictions. 

16.34 As we explain formally in §3.3.3, the 
progressive inclusion of the bourgeoisie into the 
elite (taking place sooner in the Netherlands 
and England) changes the State's attitude to- 
wards the economy from pro-monopoly to pro- 
competitive. 

16.35 Rent-planning is only sympathetic to 
drastic innovations that expand activity into 
new directions and require the establishment 
of new cliques. The improvement of an exist- 
ing process or the novel ability to substitute a 
good or service will be opposed by the clique 
currently in charge unless it can appropriates 
it. Thus, depending on the bargaining power 
of this particular clique within the entire elite, 
the innovation will be more or less retarded. 

16.36 As noted by Vargas Llosa in de Soto et al. 



(1986), "When legality is a privilege available 
only to those with political and economic power, 
those excluded — the poor — have no alternative 
but illegality". 

16.37 Pirenne (1914) recalls how guilds 
or workers opposed technological innovations 
with destruction of machines. One can like- 
wise interpret the slowdown of the US and UK 
economies in the 70s as the result of entrench- 
ment of pressure groups at the head of the rul- 
ing elite. 

16.38 Religious zeal (including the soviet brand 
of communism) motivated some advances in 
military technology but we dot count these as 
advancing human welfare. Fundamental re- 
search, on the other hand, being a public good 
is equally subsidized by all affluent states. 

16.39 We believe rent-planning to be more ac- 
curate for the developing world since the ruling 
elite leads the relationship and uses coercion, if 
not outright violence, to achieve its objectives. 

16.40 This is fully consistent with the commit- 
ment problem of the State who promised the 
private managers of the privatized firms that 
their markets would be protected forever. Once 
a new government steps in, it feels free to re- 
nege on its promises to cater to voters' interests. 

16.41 A recent and extreme example would be 
Spanish air control. 

16.42 Beyond psychology and moral senti- 
ments of justice, risk aversion plays a role since 
the return to owned assets is known whereas 
that of "would be" assets is, be definition, un- 
certain. 

16.43 The euractiv think-tank provides a com- 
parison of the two systems. According to a re- 
cent interview from the think-tank cfoeurope 
with financial officers of large firms, the lobby- 
ing in Brussels is on the rise. 



16.44 A bribe is not a welfare neutral transfer 
because a fraction is always lost in transaction 
costs. For instance, money laundering does not 
convert dirty dollars into clean ones on a one-to- 
one basis. There is here an exact analogy with 
taxes and the cost of public funds (cf. §17.1.2). 

16.45 It is itself subject to collusion or bid- 
rigging. 

16.46 The payment to the State is a transfer 
that bears no wealth effect if the winner is a 
firm, not an individual. 

16.47 He rescues from oblivion a French (and 
to a lesser extent German) practice brought 
to British attention by Chadwick (1859) (cf. 
§16.4.1 on concessions). 

16.48 In some countries, tenants are required 
to post a month or two of rent in a blocked bank 
account to guarantee that when they leave 
there is enough to rehabilitate the flat to its 
original state. 

16.49 Lately, upon change of majority in the 
town hall, some contracts have been inter- 
rupted and, on average, public provision is now 
cheaper. Since the private utilities benefit from 
scale economies and public provision is deemed 
productively less efficient by the same report, 
the previous stylized fact indicate that private 
providers enjoy a rent. 

16.50 In US parlance, privatization refers to 
the provision of a public service by a private 
(regulated) firm as opposed to in-house provi- 
sion by a public institution while in European 
parlance, it refers to the sale of a State Owned 
Enterprise (SOE) to private investors. 

16.51 Quite often, the ruler starts by prohibit- 
ing an activity (on moral grounds) in order to 
later sell selective exemptions. 

16.52 This is called Tax Farming. The max- 
imal incentives we refereed to above have the 



undesirable consequence, that the tax-farmer is 
lead to abuse his coercive power to extract more 
than the nominal tax rate or amount. 

16.53 His account regards France but seem to 
apply equally to the rest of Europe. 

16.54 This scheme is risky thus promotes pro- 
ductive efficiency yet, in the case of a service 
sold to the public it has a poor record for alloca- 
tive efficiency since monopoly pricing will take 
place. 

16.55 When the price is zero, one speaks of 
expropriation. Notice that the SOE is an old 
concept found in many civilizations such as 
Pharaonic Egypt or the Inca empire. 

16.56 Notice that these economic transactions 
do not belong to the standard we use in this 
book since one party enters unwillingly the ne- 
gotiation. 

16.57 Bel (2006) recalls that the very first pri- 
vatizations were undertaken by Nazi Germany 
during the 1930s. 

16.58 Rising unemployment triggers the cost 
of welfare benefits, social security programs 
are still being extended, pensions become more 
costly because of progress in longevity. 

16.59 This is what theory recommends but not 
necessarily what happens since there are cases 
where the regulator was quickly captured by 
the incumbent and did nothing to foster entry. 

16.60 Typically, the liabilities related to pen- 
sions or health care are taken over by the State 
which means that they will be paid by future 
generations and are not to be discounted from 
the receipts of the sale. 

16.61 Inspired by the private initiative of En- 
glishman Freddie Laker in 1977 (cf. Examples 
in §10.4). 

17.1 The validity of this analysis took long to 



be recognized since Marshall (1890) and Pareto 
(1906), to cite a few, opposed it. 

17.2 He mentions two railways companies op- 
erating between Paris and Versailles on the op- 
posite banks of the Seine river who ended-up 
merging quickly. 

17.3 He sees it essential to the community 
in the areas of national defense, geographical 
cohesion, political transparency and economic 
progress. 

17.4 "Competition between a limited num- 
ber of entrepreneurs is rationally nothing but 
a passing phase after which there is the defini- 
tive creation of a sole monopoly based on the 
ruin of the others, or a monopoly of all of them 
or of some of them in coalition." 

17.5 At the same time, luck often bestows 
someone with a monopoly position or consider- 
able augment the value of an asset. Profits so 
derived are then justly seen as windfall, but in 
the absence of a clearcut method to distinguish 
the two ways, we should permit the latter to 
embolden the former. 

17.6 To understand this position, it is useful 
to ponder the opposite polar case when, for in- 
stance, Apple charges preposterous prices for 
its gadgets to devotees and no one complains. 

17.7 An important element put forward by con- 
testability theory are sunk cost since in §6.1.7 
it is shown that with zero sunk cost, the mono- 
poly is disciplined to efficient pricing at average 
cost by the threat of entry. 

17.8 The task is easier in a city where the net- 
work has many rays so that failing one is not so 
costly; however the fact that inputs are comple- 
mentary remains. 

17.9 For instance, in 1822, the Parisian au- 
thority gave monopoly franchises over boroughs 



to city gas producers against an obligation to 
serve street with minimum demand. 

17.10 Unbridled competition in the UK rail- 
ways lasted until 1865 and, as such, represents 
a test of the natural monopoly. Foreman- Peck 
(1987) shows it had higher cost of service (when 
compared to other European countries) due to 
unnecessary duplications. In retrospect, the 
pro-active continental policies for development 
planning proved to be more economical. 

17.11 Crocker and Masten (1996-01-01) ex- 
plain how the relationship evolved from perfor- 
mance to relational contracts with a dedicated 
commission. The switch to State regulation was 
therefore only a change of scale, not of contrac- 
tual nature. 

17.12 As an alternative, Hotelling (1938) sug- 
gests inheritance or land taxes but they fall into 
the problem of putting incomparable services in 
the same bag. 

17.13 Recall that we deal here with public ser- 
vices not public goods (cf. §16.1.1). 

17.14 This is the case for R&D ventures (cf. 
§12.1.2). 

17.15 Although this issue is more pragmatic 
and carry less ideological weight, it turns out 
to be more important because it is empirically 
more relevant. 

17.16 Even in the best case where the nature of 
the service allows competition among produc- 
ers (e.g., cultural clubs and services not based 
on an immobile facility), the client is only tak- 
ing the membership binary decision; social ob- 
jectives will be fulfilled only if the industry is 
regulated. 

17.17 Boiteux (1956)'s equation (17.2) is iden- 
tical to Ramsey (1927)'s optimal indirect taxa- 
tion scheme. Baumol and Bradford (1970) re- 



late both works to the general theory of welfare 
alluded in §2.3.3. 

17.18 Letting q = [qi) isn denote a bundle of 
quantities, the solution of maxqVK(q) under the 
constraint Wsiq) > F ranges from the perfect 
competition (efficient) outcome q* when F = -co 
(the constraint never binds), to the monopoly 
outcome q M when F = W s {q M ) (this is seen by a 
simple contradiction argument). The case F = 
is generally understood as "second best". 

17.19 Beyond basic cases such as water, energy, 
transportation or telecommunications whose 
justification has been given before, one can add 
preventive health and education as contribut- 
ing to homogenize opportunities within the en- 
tire population (cf. §16.1.1). 

17.20 This setting covers the monopoly as well 
as the perfect competition paradigm. 

17.21 Beware that the pure monopolist tends to 
restrict output; thus, in the case of a benin neg- 
ative externality, the monopolist still produces 
less that the efficient level i.e., as Hotelling 
(1931) noted, he can be the environmentalist's 
best friend. 

17.22 Laffont (1977) refines the analysis by 
showing that technological uncertainty (rele- 
vant information unknown to everyone) does 
not play any role for choosing a policy instru- 
ment as long as the expectations of actors are 
the same. 

17.23 More venal employees will seek greater 
budget in order to get higher salaries, perks, 
esteem, economic power, patronage capacity, 
management leeway. 

17.24 Unless TWTP is extremely large and 
marginal cost rises slowly, 2q* < a, the bureau 
absolute ideal. 

17.25 Unless bureaucrats can rob from the 
budget (in which case the concentration of 



wealth is still undesirable), they tend to spend 
over items that are only imperfect substitutes 
to cash payment. For examples, they provide 
employment or award contracts to friends who 
are not the cheapest alternatives in the mar- 
ket. They thus raise costs which justify greater 
budgets in the future. 

17.26 Against the discretionary use of the bud- 
get, we may mention the control exercised by 
users (seeking maximum output) and employ- 
ees (seeking maximum wage bill). 

17.27 Notice that with average cost pricing, 
each revenue is equal to the attributable cost 
plus a common markup since after simplifica- 
tion we obtain Q = ^r ( c j<7/ + Pi)- 

17.28 The obtainment of extraordinary (eco- 
nomic) profits (n > 0) is equivalent to beating 
the market in terms of rate of return on in- 
vested capital; it attracts new investors in the 
activity. 

17.29 Since the cost of capital r is an average 
over markets with free entry where long-run 
competitive prices are observed, it is very likely 
that p* > r holds true given that the present 
market is monopolized. 

17.30 Analytically, sign (J|) = -R + wL- wK§^ = 
-R + wL + wK^- < -R + wL + w L = -n < because 

<Pj ID 

the MRTS -^ is equal to the price ratio at 
{K* ,L*) and then decreases as K increases. 

17.31 By (17.5), the "eye-drop" is in fact a zero 
profit curve computed with p as the price of cap- 
ital. It is convex because if we take the aver- 
age of two input mixes then the cost is exactly 
the average due to linearity while the revenue 
is more than the average because the revenue 
function is concave in quantity. Hence the aver- 
age mix yields a greater profit i.e., corresponds 
to a greater rate of return. 



17.32 Some people go as far as arguing that 
during the old days of the AT&T monopoly over 
telephone in the US, the price cap was higher 
than the monopoly price. 

17.33 Universal Service was a commercial 
strategy of extensive geographical coverage by 
AT&T to compete against the older telegraph 
service of Western Union. 

17.34 State Aid is permitted for regional de- 
velopment, R&D, environmental protection, re- 
structuring of firms, promoting culture, pre- 
serving heritage and to fight unemployment. 
There exists block exemptions for small and 
medium sized enterprises, training and em- 
ployment. According to Eurostat, State aid has 
been halved between 1994 and 2004 from an 
initial level of around one percentage point of 
GDP (cf. also ). 

18.1 Our effort saves the reader from be- 
ing buried under literally hundred of pages of 
Hamiltonians and useless retelling of optimal 
control theory by economists. 

18.2 We use ' to designate temporal deriva- 
tives. 

18.3 Depending on application, it is called the 
Golden Rule savings rate or the fundamental 
equation of renewable resources. 

18.4 In some later models, population L grows 
at rate £ and the objective is total utility LuiqIL) 
so that we need to subtract £ from r but this 
threaten convergence of the intertemporal ob- 
jective. Others (Cass, Koopmans) have di- 
rectly considered maximizing per-capita utility 
in which case it is obvious that in the steady 
state equation a is replaced by a + i, which 
amounts to add a to the discount rate r in the 
FOC! 

18.5 Technically u is concave. At the individual 
level, this is monetary risk aversion but at the 



aggregate level, this is aversion to income in- 
equality, cf. debate on climate change policies 
where protecting the world for future genera- 
tions comes at the cost of maintaining current 
income inequalities. 

18.6 Another intuitive derivation ask the fol- 
lowing: given current capital k, should we move 
to a different stationary state? Assuming that 
capital can be bought at market rate r, the 
answer is found by maximize current value 
v{k + £,e) under the budget constraint r£ - e = 0. 
The FOC of this standard optimization is that 
the RMS be equal to the price ratio i.e., r = 0(/c); 
the solution is exactly [k,e) = (0,0) when k=k* . 

18.7 More precisely, the easier extraction is, 
the larger the quantity extracted at every in- 
stant. 

18.8 If T = +oo then q t = q* for all t, so that 
total extraction diverges while if T = 0, total ex- 
traction is nil. The equation under study has 
therefore a minimal positive solution for each 
k. 

18.9 In short, he/she has no regards for his peo- 
ple' felicity, an eagerness for ostentatious im- 
mediate consumption and is under the constant 
threat of demise through a coup d'etat. 

18.10 cf. Long (1975) and Sinn (2008). 

18.11 When r is small, 1> 2^r. 

18.12 The fact that T m > T* tends to increase 
output but it is a small effect compared to the 
twice factor. 

18.13 We do not distinguish between ex- 
haustible and renewable resources i.e., we pool 
fisheries, land (for hunting and grazing but not 
toiling) with the underground soil (e.g., oil, gas, 
minerals, water). 

18.14 As shown by Nobel laureate Ostrom 
(2003), the management of commons is often 



successfully organized by the involved commu- 
nity. 

18.15 Recall that DRS is equivalent to decreas- 
ing average return or marginal return lesser 
than average return. 

18.16 This does not mean that all resources 
are over exploited; yet it can be shown that 
ta > ib => I a > l A i.e., the better resource is over 
exploited. 

18.17 Such a phenomenon is frequently ob- 
served in developing economies trying to har- 
ness efficiency gains through the privatization 
of commons. 

18.18 Such a use of taxes is known as repric- 
ing or the double dividend in the field of public 
finance. 

18.19 Kotchen and Salant (2009) neverthe- 
less show that if the tragedy is severe, a mild 
tax can reduce exploitation and increase profits 
at the same time although the device will fall 
short of restoring full efficiency. See also §5.1.3 
where we tackle the effect of a tax over the en- 
tire industry. 

18.20 The fact that in the first round, the total 
distributed exceeded the aggregate output was 
a plain mistake. 

18.21 Minerals are geologically speaking re- 
newable resources but we call them exhaustible 
because their growth is unnoticeable at the 
scale of human activity. 

18.22 The 200 miles Exclusive Economic Zone 
zone was created in the 1980s when satellite 
and aviation technology enabled governments 
to enforce their exclusivity over the said area. 

18.23 Typically newborns minus deaths in the 
period. 

18.24 It fits quite well the data for many wild 
species over a short period of time. 



18.25 In ecology, one speaks of positive recruit- 
ment or more births than deaths per period of 
time. 

18.26 Set y = Ilk and use y = -klk 2 to get 
The equation is now ^ = (l-^jr 

<=> /• = ^r~r = n{ Z~ l) whose solution is y - 1 oc 



k = - I 2. The equation is now -y r , 

y _ ain(y-l) 
y-\ dt 

p-rt y>-l _ p -rt 

18.27 For q > q, the bell curve p{k)k- q is en- 
tirely below the axis i.e., the rate of change is 
5 = k < meanwhile k t > 0, so that population is 
ever decreasing. 

18.28 This aggregate formula is valid at the in- 
dividual firm level under the quite acceptable 
proviso that all firms are equally efficient i.e., 
qi = kLt for any firm i. 

18.29 The open access equilibrium for a renew- 
able biomass is called the bionomic equilibrium. 



(l-7?)p+l+y 8p7?+((l-7/)p+l) 2 
4p 



18.30 Analytically, k n ■■ 

1 o 8p (l + rj) (l - p) < o p > 1. 

18.31 Note that the static oligopoly with n 
firms correspond to dynamic extraction by a 
monopoly with discount factor solving k n = k n . 

18.32 This feature leads voters to see the tax 
as a disguised mean to increase general taxa- 
tion and government revenue. 

19.1 It is called risk-free because the securities 
emitted by governments of countries who never 
failed to pay their obligations are considered to 
be without risk of default (cf. Sovereign Rat- 
ings). 

19.2 For a lM€loan at 5% over 20 years, the 
difference between the approximate and exact 
rates would be 61€ per month. 

19.3 The support of H is [x;x] where x is the 
largest solution of H{x) = and x is the smallest 
solution of H{x) = 1. 



19.4 Bounds are removed when they do not 
convey useful information for the computation. 

19.5 Let ( = E fc >! i, then | = £ fc > 2 js=(-j, thus 
f=l. 

19.6 Let f = E fcal ^, then f = E fcal ^ + 
|Lfc>2 f^r = 1 + \S by the result of footnote 26.3, 
thus £, = 2. 

19.7 This is Jensen inequality. To prove 
it, observe that the concavity of u is equiv- 
alent to the property u{x) x ~_f o Xo) < u'[x ) (the 
chord is flatter than the tangent at x ); tak- 
ing x = E [x] and integrating we obtain E [u{x]] < 

E[u[Xq) + ll'{Xo){X- Xq)] = U(Xq) + u'(jco)E[Jc - Xq] = 

u(E[x]). 

19.8 Observe that the function p completely 
characterizes the preferences since it enables 
to recover the utility function u up to an affine 
transformation (cf. proof). 

19.9 The same problem plagues the concept 
of Pareto dominance in that both define only 
a partial order over the objects one would like 
to rank. This is why public economics rely on 
welfare functions to aggregate preferences and 
build a social preference out of individual ones. 

19.10 The original authors do not put a great 
emphasis on naming their proposal. Hart 
(2010) calls them economic and operational 
which does not convey an intuitive meaning. 
Our naming choice reflects the intellectual debt 
we owe to these forerunners of risk studies. 

19.11 An additional necessary condition is that 
no gamble can be always accepted by one of 
the above people. It is equivalent to require 
utility unboundedly negative when approach- 
ing bankruptcy. 

19.12 This innocuous hypothesis is necessary 
for the technical analysis to go through. 



19.13 We drop the ~ from random variables in 
the footnotes for clarity. The function /(a) = 
1 - E[e" a £] starts at for a = 0, then increases 
since /'(0) = E[g] > but tends to some negative 
value for a large because the gamble achieves 
a loss with positive probability. It must have 
another zero p„. Another definition would be 
lim e ^ sup{as.£.|/(a)| < e}. This way the riskiness 
of a sure win is zero while that of a sure loss is 
infinity. 

19.14 Being ARRA, the richer they are the 
bolder they act as p' < 0; but IRRA limits the 
force of this effect since -p/x< p' '. CRRAs are 
those with maximum such effect. 

19.15 Let (p be the common riskiness, we 
have 1 = E[e-8'<P] x E[e _ s' / «'] = E[eS l <P e -8'i<P] = 
E[e-&+* , > / «»]. 

19.16 The function f(a) = E[log[l + ag]] which 
starts at for a = 0, then increases as /'(0) = 
E[g] > and goes to -oo for a close to II L where 
L is the maximal loss of the gamble. It must 
have another zero and </>„ is its inverse. 

19.17 More formally, <p g is the wealth required 
but also sufficient to avoid bankruptcy when 
playing repetitively the gamble. Notice that 
bankruptcy must be understood as losing all 
the money one has allowed himself to invest 
into risky prospects i.e., keeping enough to at- 
tend living expenses. 

19.18 The weight onto negative values being 
so small, the a parameter of footnote 26.3 must 
tend to II L in order to push the log towards -oo. 

19.19 This is to be expected from a measure 
named after an insurance specialist. 

19.20 This is to be expected since Bernoulli was 
involved with gambling in the first place. 

19.21 The previous reasoning is false for a fi- 
nancial firm since it always takes care to di- 



versify its assets across markets and activities 
in order that a loss somewhere be always com- 
pensated by a gain elsewhere; also it tries to 
build large portfolio of similar clients in order 
to smooth out the effect of chance (or bad luck) 
down to the mean of the underlying risk. 

19.22 We assume that the resulting profit is 
non negative. 

19.23 If Y = f{X) where /' < 0, then > [X - 
X )(Y - Y ) whether X < {>)X because the sec- 
ond term is always of the opposite sign of the 
first. Integrating conserves the inequality. 

20.1 There is a subtle difference in the litera- 
ture regarding the action of the agent: it can 
be observable by the principal only or verifiable 
by the judge (on top of being observable); this 
distinction is sometimes referred to as soft vs. 
hard evidence. 

20.2 What is meant here is that although it 
would be technically possible to perform such a 
monitoring, it would be so costly that it is better 
not undertaken. 

20.3 Notice that minimal effort can take sev- 
eral meanings. When the job is painful it is 
presence at the work place that will determine 
this minimum level (absence is easily verified). 
If the agent loves his work, the minimum level 
is in fact the threshold above which he refuses 
to sacrifice family life without additional com- 
pensation. 

20.4 Although the equation to solve is still 
q'u' = c', it is evaluated at a different income 
level; hence wealth effects could alter the first 
best effort but it remains first-best conditional 
on 7r, just like e* was efficient conditional on u. 

20.5 The market price can be normalized to 
unity by scaling adequately the output unit. 

20.6 The case of yardstick evaluation is similar. 
One replaces e 2 by the fixed objective q to obtain 



the same FOC of effort. Setting q = j3h{0) makes 
e = q the solution of the FOC. The only differ- 
ence to derive the cost formula is that l//i(0) 2 is 
half the previous value because H is the law of 
ei only. 

20.7 Lazear and Rosen (1981) nevertheless 
show that if the worker displays decreasing ab- 
solute risk aversion, the tournament may be 
the preferred alternative. 

20.8 W.l.o.g. that the same agent is in charge 
of the activity whose productivity is one while 
the other looks after the activity whose produc- 
tivity is y. This can be checked from the final 
profit riteam formula. 

20.9 Indeed there is no point to relate his pay 
to the result of line (5 since this would only add 
a risk burden given that he does not work on 
that line. 

20.10 W.l.o.g. a = 1, n ind - Ilteam is propor- 
tional to 2p 2 -(y- l) 2 y + 2p(y 2 + l-y). The mean- 
ingful solution to the quadratic equation in p 
is is a convex increasing function /(y) satisfy- 
ing /(l) = and asymptotic from above to L j-. 
Hence n team > n ind o p < /(y) which we approx- 
imate in the text by the asymptote. 

20.11 Absent from the model is the discount 
factor for future earning; clearly, the more im- 
patient (eager to live by the day) is the em- 
ployee, the lesser the implicit incentives. 

20.12 The rest of the time can be dedicated to 
shirk or pursuing a personal objective i.e., dif- 
ferent from that of the principal. 

20.13 We assume here that the principal is risk 
neutral i.e., cares for profit only; otherwise his 
objective would be the utility of ex-post profit. 

20.14 Given that u and n are concave, n J^Jf is 
increasing so that equation (20.12) has at most 
one solution in w\ as the LHS diminishes with 



q this solution increases with q. We might how- 
ever have co* (q) = over some interval of low q's 
and cj*(q) = q over some interval of large q's. 

20.15 We can safely assume that the IR con- 
straint is binding (A > 0) whenever c is convex, q 
is bounded and conceivable effort is unbounded, 
for under these hypothesis U{q,e) diverges to 
-oo as effort increases. 

20.16 As in footnote 26.3, a higher q diminishes 
the LHS of (20.11) but raises the RHS so that 
the solution must increase. Notice that this 
added hypothesis is the MLRPseen in §19.4. 
Indeed, -f = ^^ implies that for e < e' we have 
ln(|gf ) = f e e '^{x,q)dx, thus the MLRP is ei- 
ther that the LHS is increasing in q or that -f 
is increasing in q. 

20.17 The equation determining \i is fn{q- 
to(q))h e {e,q)dq = -[i{f u{io{q))h ee {e,q)dq- c"(<?)| 
where the term in braces is the second order 
condition of utility maximization for the agent, 
which is necessarily non positive. 

20.18 This reversal is not as extraordinary as 
it may seem: having provided effort the agent 
has become indispensable to the principal and 
might have acquire the bargaining power. 

21.1 If two coins have the same face value but 
are made from metals of unequal value, the 
cheaper will tend to drive the other out of circu- 
lation; the more valuable coin will be hoarded 
or used for foreign exchange instead of for do- 
mestic transactions. Although British authors 
refer to this fact as Gresham Law, Nobel prize 
winner Robert Mundell recalls us that ancient 
greeks philosophers suggested the law which 
was clearly enunciated in the middle ages by 
Oresme and Copernicus. 

21.2 Certification requires a State (justice and 
police) strong enough to deter impersonators 
from defrauding clients and the very certifiers 



from either abusing their dominant position or 
selling false certificates. Examples still exists 
with the auditors in the Enron case or the credit 
rating agencies in the subprime meltdown. 



21.3 Letting c = 



2(l-A)(l-w) 2 
(2-A) 2 



it is possible to 



check that the solutions to equation n A = n B are 
c = c where both profits are nil (which defines 
Ao implicitly) and c = (1 - A) 2 c which defines Ai 
implicitly. 

21.4 This idea is independent but reinforces 
the traditional motive of education which is to 
increase one's own ability. 

21.5 The net present value of future payments 
of 1€ per period starting tomorrow is II r. 

21.6 This is obviously an extreme assumption 
only used for its simplicity. 

21.7 An exhaustive formal reference is Laffont 
and Tirole (1993). 

21.8 We follow the model of agency relationship 
seen in §20.1. 

21.9 As in §4.3.2 for price discrimination, ex- 
cluding inefficient agents in order to improve 
the performance of efficient ones is possible but 
if V is large this is never optimal. 

21.10 This technique can be illustrated with 
the following simple question: who is the tallest 
Dutch? If we lack a Dutch database we can still 
search a European database and use our intu- 
ition that Dutch are the tallest people of Eu- 
rope. Once we encounter the tallest European, 
we only need to check that he/she is a Dutch 
citizen. 

21.11 The general model requires only that 
Phi) - P;(.) > is valid for the entire domain. 

21.12 In the general case, the IC constraints 

are f q " [P h - P,) > u h - u x > f q ' [P h - Pi). 

21.13 In the general case, the equation is (1 - 

a){Pi - c) = a{Ph - Pi) o Pi = c = (1 - a)c + aP^ > c. 



21.14 The proper demonstration uses the fact 
that types vary continuously and plays with the 
IC condition as follows: for any v, we must 
have u [9, q[9)) > u [9, q{y)) o 9U[q{9)) - T [q{9)) > 
9U{q{v)) - T{q{v)) => dU'q' - T'q' > by letting 
v tend to 9. Inverting the roles of the two 
variables, we obtain vU'q' - T'q' < 0, hence the 
optimal scheme q{9) must satisfy 9U'{q(9)) = 
T'(q(9)). 

21.15 Properly speaking, differentiate (21.13) 
and use the SOC associated to (21.13) to prove 
that q'{.) > 0. 

21.16 If the solution to the FOC is not in- 
creasing, then some "ironing" is needed to force 
non decreasing-ness (cf. Fudenberg and Tirole 
(1991), ch. 7)). 

21.17 If k > ^y^, the capacity is large enough 
to meet demand at the monopoly price p M {a) = 
g ^^\ the announcement a = k+ ^^ > a does the 
trick. If demand is so large that there is conges- 
tion at the monopoly price, the firm simply tells 
the truth and sells all her capacity at the max- 
imum price Pkia)- 

21.18 Formally we also have to specify a zero 
subsidy if the price is set below marginal cost in 
order to force loss should the firm ever choose to 
price that way. 

21.19 In the present context, A is an ad-hoc pa- 
rameter while in the general theory, its value is 
obtained (computed) at the equilibrium. Alter- 
natively, one can assume that the regulator has 
a strict preference for consumer surplus over 
firm rent. 

21.20 Note that (ICh) is satisfied since it reads 
> 6[qh - qi) and we saw that q h < q* h < q\ = qi- 

21.21 In a continuous setting, the opportunity 
value to the agent of reducing the item cost by 
e is d{e) where d is increasing convex. Let e* be 



the efficient effort solving 1 = d and ft = e*-d{e*) 
be the maximal technology saving. 

21.22 He won't however do more because he 
supports the financial cost d{e). 

21.23 By giving more risk sharing to bad types 
an insurer improves efficiency thus he can im- 
prove its per-capita profit over bad types. Imi- 
tation by competing insurers yields the result. 

22.1 The word auction derives from the latin 
"auctus" which means increase. British authors 
refer to the traditional Roman auction as the 
English auction; we have not been able to find 
an explanation for this bizarre praxis avoided 
by the landmark article of Vickrey (1961). 

22.2 For instance, professional auctioneers who 
used the candle auction were granted a mono- 
poly by Henri II of France in 1556. 

22.3 Ask prices (supply) are ranked lowest to 
highest while bids (demand) are ranked high- 
est to lowest. The demand and supply profiles 
so generated are then matched to determine an 
equilibrium price and an amount of trade. 

22.4 As argued in §16.4.3 & §3.3.3, the finan- 
cial motive is more reaslistic. 

22.5 Since most participants to these lotter- 
ies have limited liability, the price asked by the 
government ought to be nearly zero and in any 
case far from the true highest value; as a con- 
sequence, the government is relinquishing part 
of its rent in favor of a single firm or agent and 
will be forced to resort to distortionary taxes to 
make up for the difference. This constitutes a 
first drawback of this method. 

22.6 If the price in a Roman auction increases 
by very small amounts, the last bidder ends up 
paying the second highest bid plus a very small 
amount just as in the Japanese version. 

22.7 Attrition is a second-price format whereas 



the (standard) all-pay auction is a first-price 
format. Indeed, the highest bidder need only 
expand a little more than his last opponent. 
The terminology derives from warfare and was 
first theorized in biological competition. 

22.8 We consider a Nash equilibrium of the 
auction game where bidders use optimal strate- 
gies. 

22.9 The payment is not necessarily linked to 
being awarded the item; in an all-pay auction 
for instance, all bidders pay their bid. 

22.10 Given his private value v, the bidder 
builds a bidding strategy a v that is optimal in 
this auction. Since he may learn many different 
values, he ends up with a long list of strategies. 
However, upon learning v, a v is better (at least 
not worse) than ap while upon learning v, ao 
dominates o~ v . This is another instance of the 
revelation principle. 

22.11 Check that with a starting price s, the 
expected revenue is 3 3 °~ L s s and is maximum for 

5=15. 

22.12 A monopoly sells many units while the 
auctioneer has typically a single item for sale. 

22.13 For most statistical distributions, the 
hazard rate ^^o * s increasing so that R m t) 
is also increasing. Furthermore, we have con- 
structed H from a regular demand function ad- 
mitting a decreasing marginal revenue. 

22.14 Recall that profit is n = W s - v . 

22.15 Since we assumed R m j increasing, a 
higher value vt leads to a higher marginal rev- 
enue, thus a higher or equal probability of win- 
ning; this means that the necessary condition 
(22.2) <p' > is satisfied. The general case is 
treated in appendix. 

22.16 Notice that the reasoning we've used 
here was identical to that shown in Table 22.1. 



22.17 The public policy implication is that gov- 
ernments should only aim at providing educa- 
tion to those who cannot pay for it (on top of its 
regalian missions). This way, the most brilliant 
and hard-working people will successfully bid 
for the control the scarce economic resources, 
generate the highest added value and in the 
mean time grab for themselves a fair reward. 

23.1 Likewise, the entrepreneur "does the right 
thing" whereas a (good) manager "does things 
right". As wittily stated by Marshall (1907) in 
a related matter, "the government could print a 
good edition of Shakespeare's works, but it could 
not get them written". 

23.2 Syverson (2010) says that "productivity 
is quite literally a matter of survival for busi- 
nesses". Capital accumulation and the expan- 
sion of the labor force matter also but are less 
desicive. 

23.3 Capital is of the circulating kind i.e., it 
completely depreciate in the process of value 
creation. 

23.4 If we choose to express profit in terminal 
value then the revenue is simply R but the eco- 
nomic cost is then (1 + r ) k to account for the op- 
portunity cost of time. 

23.5 We also assume R m {0) > 1 to guarantee 
that k* is positive, thereby avoiding trivialities. 

23.6 It is customary in the literature to use the 
letter "u" as in utility to denote the objective 
function of the agent. 

23.7 From R m {k a ) = ^, we deduce ^f = 

(izfe < ° and d ~t = W C*m - « < ° since R m > 1 
over the range k < k i.e., i.e., investment and 

agency value decrease with a the outsider par- 
ticipation. 

23.8 Since we assumed 1 < R m (0) =lim ^, it 
must be true that \>lim JrjH. 

a _ 1 K(K a ) 



23.9 Indeed, g = ^^^ = %*■ {} 0^ since ^ of a DRS technology is greater than its 
Rm = jz^ at k a ; now A; a being the maximizer of marginal R m {k), thus i = ^^ > R m implies that 
(1 - a)R- k, the latter expression is positive at 
k a , hence ff < 0. 

23.10 At best for her, the participation con- 
straint of the investor is binding i.e., F= aR{k a ) 
so that her final profit is at best n a - F = (1 - 
a)R{k a )-k a whose derivative is -R{k a ) < by the 
envelope theorem (k a is an optimum). Hence 
her payoff is maximum at a. 

23.11 What really matters is that the firm 
has not enough cash or marketable securities 
or risk-free debt to cover the cost of the project. 

23.12 Regarding existence of V no and V gQ it is 
enough to assume an infinite tail for the sta- 
tistical distribution of x. This implies that 
E[x|go] = E[x\x < (1 + r)V go ] is an increasing con- 
cave function of V go , thus the equation v-rk = 



E[x|Jc< [l + r)v] has a unique solution in v so that 
V go is uniquely determined; it is increasing in 
the project size k. Now, V no = E[x\x > (1 + r)V go \ is 
also uniquely determined. 

23.13 a < a o \/fi{p. + 4a 2 ) - fi < 2a 2 ^p,la 2 = 

2\/\l0 2 

o 4p,a 2 > ( x/M(M + 4o " 2 ) - m) 



o 2^y/ii{^, + 4a 2 ) > /i(/i + 4a 2 ) + \i 
o V 'fiifj, + 4a 2 ) > |U o nifi + 4a 2 



2 



1 > pR m {k) thus pR m {k) - 1 < for p £ [0;p[ and 

therefore -£ > / +o ° [pR m {k) - l) dH{p) = -£-. 
dk dk 

23.17 Alternatively, the random component 
could be the cash-flow generated by current as- 
sets out of which the entrepreneur will finance 
part of her new investment. 

23.18 This presentation is inspired by 
Berkovitch and Kim (1990) and Yossi Spiegel's 
teaching notes. 

23.19 She can emit new equity paying a fixed 
dividend to avoid moral hazard. 

23.20 Conditional on the cash flow realization, 
the optimal investment is a corner solution at 
x. 

23.21 Assuming that the law H{k,x) admits a 
density h(k,x), the change from k to k satisfies 
the monotone likelihood ratio property (MLRP) 

if * > k =*■ SfS / in x - The family of distribu- 
tions h{k,.) is said to satisfy MLRP if the previ- 
ous property is true for all parameter values. 

23.22 As shown in §19.4, the MLRP property 
implies F SD (first 2 o rder stochastic dominance), 



9 H; < H k and it is easy to check that the expec- 
4pa - 2/i 

tatiqn of an increasing function increases with 
> n o p + 4a L > jx o 4a > ! 



23.14 An example of different preferences 
would be Italian football teams which seem 
more interested in avoiding an overwhelming 
defeat than clinching a victory. 

23.15 Football in major tournaments is rather 
like the war of attrition where each team waits 
for an error or a risk taking by the other side 
to counter-attack and secure an advantage (cf. 
§7.4.2). 

23.16 The debt-financed entrepreneur earns 
marginally more. Indeed, the average return 



a FSD change. 

23.23 Our claim is the contraposition of the 
following property: for any k < k* ,n{(p y ,k) > 
n{(p d ,k). To prove the property, we use MLRP: 
E[ Td |r] = E[ T |r] ^> E[ Td \k] > E[ r |*]. This 
says that every renegotiation offer <p acceptable 
after j d is also acceptable after y (both condi- 
tional on investment k), hence the maximum of 
n for renegotiation following j cannot be less 
than the maximum of n for renegotiation fol- 
lowing jd- 

23.24 The notion is second order stochastic 



dominance. 

23.25 Since that R(x) is concave, -R is convex, 
it is thus enough to apply the above characteri- 
zation of riskiness to see that riskier applicants 
repay less on expectation. 

23.26 Draw n[x) = max{x- (1 + r)d,0} as a func- 
tion of x to check the effect of increasing r. 

23.27 This critique is not given prominent 
space because the model does not generalize be- 
yond the success-failure model used by these 
authors. 

23.28 Other typical but also imperfect mea- 
sures to align the objective of managers with 
that of shareholders are salary incentives (par- 
ticipation to profits) and stock-options. 

23.29 This is the famous irrelevance theorem 
of Modigliani and Miller (1958). 

23.30 The remnant could also be more brutally 
diverted into perquisites i.e., an immediate con- 
sumption of no value for the firm. 

23.31 Instead of an additive uncertainty, we 
could equally use a multiplicative market price 
uncertainty like in previous models at the cost 
of heavier formulas. 

23.32 We assume that H is one of the many 
distributions whose hazard rate is increasing. 
Since the LHS of the FOC is increasing with k 
while the RHS is decreasing, there is indeed a 
unique solution k. 

24.1 Notice that there shouldn't be too many 
of them because in a world with a million stan- 
dards, there is no "real" standard! 

24.2 Have you ever tried to remove the motor 
of your car and replace it by a different one ? 

24.3 One speaks of "one-stop-shopping" when 
consumers enjoy a discount for purchasing mul- 
tiple services from a single supplier (cf. §5.3.3 
& §24.1). 



24.4 Proof: Lowering p AB below p* A + p B is akin 
to raising p B above p B but since the latter is 
at its optimal level, the loss in sales (of good 
B) is exactly compensated by the increased rev- 
enues (on good B). However, there is a spillover 
effect in that some people who would cease to 
buy good B switch to the bundle, thus raising 
profits. 

24.5 For example, in the context of bundling 
computer programs it does not seem farfetched 
to assume that selling components separately 
would require substantial extra programming 
costs in order to guarantee compatibility of the 
components with older softwares, costs that 
could be avoided if the new programs are bun- 
dled. 

24.6 If the consumers leisure budget is exoge- 
nous, the long term equilibrium determines the 
number of active firms (or software) as the ratio 
of revenues to development cost of one software. 

24.7 Game theory has a branch studying re- 
peated interaction; it has shown that coopera- 
tion can emerge in a non cooperative environ- 
ment. 

24.8 Historically a circus wagon carrying a 
musical band; today a current or fashionable 
trend. 

24.9 Their model reframe previous works of 
Rohlfs (1974) and Farrell and Saloner (1985). 

24.10 Solving for *g = 0, q° = |(1 -s); the maxi- 
mal sustainable price is c° = \ {s+ l) 2 . 

24.11 We have W'{b) = s- c- \b{3b + 2s-4) and 

* _ 2-s+\A(s+2)+4-6c 



24.12 n M {c) = i-"V^+2) + i-3g 

24.13 An externality is pecuniary if its trans- 
mission vector is the price mechanism, other- 
wise it is technological. 

24. 14 If innovations in the modern sector can 



be protected by IPRs such as patents or strate- 
gic entry barriers, then monopoly ensue (cf. 
§10.2). If imitation is easy then competition 
involves equal cost thus leads to zero producer 
surplus which will bar a challenger from enter- 
ing given the presence of the fixed cost (cf. §6.1). 

24.15 If there were n varieties, instead of a con- 
tinuum, we would have to replace L by Lin from 
this point on. 

24.16 He can only grab the surplus of the 
weakest sector in each sector. 

24.17 In this simple setting, there is a multi- 
plicity of equilibria and the one that actually 
takes place depends on considerations outside 
the model. 

24.18 The same goes for boats since the conduc- 
tor steers with his right arm (cf. Venetian pad- 
dlers), the tiller is on the right side and to pro- 
tect it when crossing incoming traffic one has to 
keep on the right side. 

24.19 It is also argued, with less force, that the 
revolutionary Robespierre countered the papal 
advice on political grounds with a mandatory 
opposite custom. 

24.20 In this market Microsoft's products 
achieved the best ratings and quickly reached 
a 80% share; somehow "Mac" users taught MS 
how to make friendlier products for the PC 
world. 

24.21 CNET reported cost estimates for the 
Xbox from 320 to 400$, the initial public price 
being 400$ later slashed down to 180$; it is 
also noticeable that a assembly plant in Hun- 
gary was shut down to favor cheaper production 
from China. 

24.22 Analytically, letting z denote consump- 
tion of positional goods, the net utility of a con- 
ceited person can be stated as u(z) - 6{z - z) 



where u is the private utility function and z is 
the average spending of the consumer's friends 
or relatives. Then the optimal individual con- 
sumption solves u' = -8 while the socially opti- 
mal one solves u' = 0. If positional goods display 
diminishing returns (i.e., u" < 0), the result ob- 
tains. 

24.23 To avoid corner solutions, it is neces- 
sary to assume an small enough (cf. Grilo et al. 
(2001)). 

24.24 Although long time clients tend to get 
served without delay, new ones (at the margi- 
nal) are often being rationed. 

24.25 In the past, passengers rights were inex- 
istent and no airline had an incentive to com- 
pete with others by increasing this side of cus- 
tomer service; thus the equilibrium level was 
zero i.e., upon late arrival, you could stay on 
the ground without any compensation when- 
ever overbooking generated an excess demand 
at boarding time. In Europe the 2005 directive 
forces airlines to pay damages to passengers 
suffering refusal to board. It is now cheaper for 
them to auction off the few missing seats. 

24.26 Disney has maintained a reputation of 
releasing video or DVD of their old movies in 
the consumer market (not the rental one) for 6 
months only with a promise to stop selling it 
for 5 full years, a time long enough for kids to 
become uninterested by the feature. 

24.27 Equivalently, if some immediate second- 
hand market was put in place to permit ex- 
change among all potential clients. 

24.28 A more realistic model would describe 
the hierarchical pyramid model where the prob- 
ability of escalating is n. 

25.1 This is close to the non-rivalry prop- 
erty of public goods but the property of non- 
excludability is not true here since connection 



to the network is controlled by the operator and 
can therefore be conditioned to the payment of 
a subscription. Exception are radio and TV if 
they are transmitted by air waves without en- 
cryption. The possibility to exclude entrance at 
school or hospital may seem outrageous to most 
but is feasible at low cost; hence neither of these 
services qualify as a public good! 

25.2 An alternative is a choice between two 
things, so that if one is taken, the other must 
be left. 

25.3 Some deliberately create queues to in- 
crease the word of mouth effect and attract 
even more buyers. 

25.4 Under the now ubiquitous ADSL or cable 
connection, traffic is free and we only suffer the 
opportunity cost of time. 

25.5 It would take two-part pricing to re- 
store efficiency; by setting p* consumers are 
led to demand the welfare maximizing quantity 
which can then be recuperated through a sub- 
scription. 

25.6 The name comes from the analogy with 
a bicycle wheel, which consists of a number of 
spokes jutting outward from a central hub. 

25.7 Consumption of electricity is measured 
in Watts (power) per hour, while the instanta- 
neous demand is measured in Watts. A Giga 
Watt (GW) is a billion Watts, a Tera Watt (TW) 
is a thousand GW. 

25.8 The effect of 1C temperature variation 
on French demand are nearly 2% in winter and 
1% in summer. Haziness effect from clarity to 
obscurity can boost demand by up to 6%. 

25.9 In recent years, as much as 21% of the 
French park was unavailable due to mainte- 
nance. Indeed, the intensive use of nuclear 
power plants requires tighter security mea- 



sures, thus longer and more frequent periods 
of maintenance. 

25.10 In France however, nuclear power ac- 
counts for 54% of the installed power and 77% 
of generation; recent studies puts the share of 
capital cost around two thirds of total costs for 
a nuclear power station. 

25.11 Notice that on days of extremely high in- 
ternal demand, exports are reduced by a factor 
2 or 3 using market mechanisms. 

25.12 It comes as no surprise that this analy- 
sis was performed by an engineer from an elec- 
tric utility, the French monopoly EDF (cf. Dreze 
(1964) for details). 

25.13 Similar ideas were considered in the UK 
but France had a higher share of industrial 
clients with whom price discrimination was ne- 
gotiable; as we already commented, price dis- 
crimination with residential clients was almost 
unanimously opposed by political forces. This is 
why peak load pricing took more time to make 
its way in the UK. 

25.14 In line with our findings on two-part tar- 
iffs, the yearly subscription is set at a higher 
level (190€ vs. 120€ ); there is also a higher in- 
stallation cost for the second (night) meter. 

25.15 This is so because the peak price is diffi- 
cult to change on short notice. 

25.16 In the last 5 years, EDF never declared 
more than 21 red days because it is prudent to 
keep one in reserve for an unexpected event. 

25.17 The elasticity of surface with respect 
to radius is 2 since for a radius R, surface is 
S = 2nR 2 , thus dS = 2nR and e = |^| = fff = 2. 
This means that a one percent increase in city 
radius generates a 2% increase in city surface. 

25.18 These areas cluster 182 millions inhabi- 
tants or 62% of the total US population in 2003 



which was 291 millions (income and popula- 
tion data from the US Bureau of Economic Ac- 
counts). 

25.19 The US Federal Highway Administra- 
tion (FHWA) estimates the remaining causes to 
be accidents (25%), works (15%), bad weather 
(10%) and poor signaling (5%). 

25.20 It is enough to narrow a 2 lanes street 
into one lane over 20 meters to generate a strict 
one lane street in terms of traffic fluidity. 

25.21 In french, but the instructive table 12 
comparing Paris and London in the conclusion 
is easy to understand. 

25.22 Competitive pressure means a poten- 
tially unlimited pool of road users ready to pay 
a price p to use the road. This turns the WTP 
function flat at the level p thus equating the 
welfare and monopoly optimal choices. 

25.23 The fact that most public infrastructure 
are financed from general taxation turns rev- 
enues from congestion pricing into manna that 
many see fit to be invested into capacity expan- 
sion. This is obviously unwarranted for these 
revenue should only pay for current mainte- 
nance and the interest charge on the debt con- 
tracted out at construction time. 

25.24 The fact that capacity is measured by 
lanes which is an integer creates a tuning prob- 
lem that is alleviated by pooling road geograph- 
ically (cross-subsidization) or temporally (be- 
cause traffic tend to grow). 

25.25 In our simple setting where demand is 



normalized to unity, these are the same. 

25.26 Notice however that in such tight condi- 
tions, a sudden use of breaks can create a chain 
reaction that can completely stop traffic. 

25.27 This is in fact the excess over the WTP 
for his next best alternative. Since the latter 
cannot be estimated, it is normalized to zero. 

26.1 The assumption v > known as the "gap 
case" is necessary to avoid running into techni- 
cal difficulties. 

26.2 Choosing s > 1 implies that z > 1 - 8 must 
hold and indeed it holds at the proposed solu- 
tion. 

26.3 Beware that this calculation is done while 
neglecting the cost of installing the service ca- 
pacity. 

26.4 The general treatment (n > 3) involves 
separating the coefficients of q_i and c i+1 from 
the other ones. Parameter j is the entry {i, i) of 
A' 1 , while 6 is twice the entry {i, i - 1). Studying 
the equation Ax A' 1 = Id, one finds 2y-8l2 = 1. 

26.5 Initially, we have /(0 + ) < and /'((T) > 
but since / is continuously differentiate, the 
two expressions converges one to the other. 

26.6 Take y = gz where g is smooth positive 
and admissible, we get = f gz 2 , thus z = f x - f± 
is uniformly nil. This is similar to stating that 
< y, z > = for any y i.e., z is in the orthogonal of 
L 2 which is the zero singleton since L 2 is its own 
dual. 
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